CSE6339

Hot Topics in Data and Information Management

class number: 24053

Spring 2008 (first day of class: Jan. 15th, 2008)
Tuesday/Thursday, 2:00 - 3:20pm, GACB 105



Syllabus      Announcements     Schedule     Resources     Project Requirements


Schedule

The tentative schedule is as follows.  We may change the schedule as necessary.

Paper Review:

Due at 11:59pm, two days before the scheduled paper is discussed. For example, R1 (the review of paper P1) is due at 11:59pm, Sunday, Feb. 3rd.

Paper Presentation Slides: (Only the presenter of the scheduled paper is required to submit the slides.)

Due at 11:59pm, two days before the scheduled paper is discussed. For example, S1 (the slides of paper P1) is due at 11:59pm, Sunday, Feb. 10th.

Date Paper# Lecture/Activities

Presenter

Due

Lecture Notes

Background Tutorial
01/15   Course Overview and Introduction Chengkai Li   [PDF] [PPT]
01/17   Review of Database Management Systems Chengkai Li   [PDF] [PPT]
01/22   Review of Data Mining Chengkai Li   [PDF] [PPT]
01/24   Review of Web Information Retrieval Chengkai Li   [PDF] [PPT]
01/29   Course Project Topics Chengkai Li   slides in WebCT
01/31   Paper Review, Presentation, Research Resources Chengkai Li   [PDF] [PPT]
Structured Querying of the Web
02/05 P1 Michael J. Cafarella, Christopher Re, Dan Suciu, Oren Etzioni: Structured Querying of Web Text Data: A Technical Challenge. CIDR 2007: 225-234   Group and Topic [PDF] [PPT]
02/07 P2 Eric Chu, Akanksha Baid, Ting Chen, AnHai Doan, Jeffrey F. Naughton: A Relational Approach to Incrementally Extracting and Querying Structure in Unstructured Data. VLDB 2007: 1045-1056     [PDF]
02/12 P3 Tao Cheng, Xifeng Yan, Kevin Chen-Chuan Chang: EntityRank: Searching Entities Directly and Holistically. VLDB 2007: 387-398 Arjun Dasgupta   [PPT]
02/14 P4 Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, Zachary G. Ives: DBpedia: A Nucleus for a Web of Open Data. ISWC/ASWC 2007: 722-735 Rahul Dhar Proposal [PPT]
Retrieval, Exploration, and Navigation of Databases
02/19 P5 Yannis E. Ioannidis, Stratis Viglas: Conversational querying. Inf. Syst. 31(1): 33-56 (2006) Aniruddha Deshpande   [PPT]
02/21 P6 Cong Yu, H. V. Jagadish: Querying Complex Structured Databases. VLDB 2007: 1010-1021 Raghu Srinivasan   [PPT]
02/26 P7 Michalis Petropoulos, Alin Deutsch, Yannis Papakonstantinou: Interactive query formulation over web service-accessed sources. SIGMOD Conference 2006: 253-264
Shivkumar Chandrashekhar
  [PPT]
02/28 P8 Wisam Dakka, Panagiotis G. Ipeirotis, Kenneth R. Wood: Automatic construction of multifaceted browsing interfaces. CIKM 2005: 768-775 Raghu Srinivasan   [PPT]
03/04 P9 Chris Stolte, Diane Tang, Pat Hanrahan: Polaris: A System for Query, Analysis, and Visualization of Multidimensional Relational Databases. IEEE Trans. Vis. Comput. Graph. 8(1): 52-65 (2002) Robin Michael   [PDF]
03/06 P10 Zhiyuan Chen, Tao Li: Addressing diverse user preferences in SQL-query-result navigation. SIGMOD Conference 2007: 641-652
Shivkumar Chandrashekhar
  [PPT]
Social Network Analysis and Collaborative Filtering
03/11 P11 Lars Backstrom, Cynthia Dwork, Jon M. Kleinberg: Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography. WWW 2007: 181-190 Xin Jin   [PPT]
03/13 P12 Nilesh Bansal, Fei Chiang, Nick Koudas, Frank Wm. Tompa: Seeking Stable Clusters in the Blogosphere. VLDB 2007: 806-817
Naved Kazi
Progress Report 1 (due at March 16th)  
03/18

Happy spring break

03/20
03/25   Group 1 (Supreeth Chakravarthy, Aditya Telang) and 2 (Shivkumar Chandrashekhar, Aniruddha Deshpande)      
03/27   Group 3 (Arjun Dasgupta, Xin Jin, Raghu Srinivasan), 4 (Rahul Dhar, Robin Michael), and 5 (Naved Kazi, Muhammad Safiullah)      
04/01 P13 Nikolay Archak, Anindya Ghose, Panagiotis G. Ipeirotis: Show me the money!: deriving the pricing power of product features by mining consumer reviews. KDD 2007: 56-65 Arjun Dasgupta    
Systems and Architecture
04/03 P14 Abhinandan Das, Mayur Datar, Ashutosh Garg, ShyamSundar Rajaram: Google news personalization: scalable online collaborative filtering. WWW 2007: 271-280 Muhammad Safiullah Progress Report 2  
04/04 Last day to drop or withdraw
04/08 P15 Seung-Taek Park, David M. Pennock: Applying collaborative filtering techniques to movie search for better ranking and browsing. KDD 2007: 550-559 Muhammad Safiullah    
04/10 P16 Panagiotis G. Ipeirotis, Eugene Agichtein, Pranay Jain, Luis Gravano: To search or to crawl?: towards a query optimizer for text-centric tasks. SIGMOD Conference 2006: 265-276 Robin Michael    
04/15 P17 Jayant Madhavan, Shirley Cohen, Xin Luna Dong, Alon Y. Halevy, Shawn R. Jeffery, David Ko, Cong Yu: Web-Scale Data Integration: You can afford to Pay as You Go. CIDR 2007:342-350 Naved Kazi    
04/17 P18 Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Michael Burrows, Tushar Chandra, Andrew Fikes, Robert Gruber: Bigtable: A Distributed Storage System for Structured Data. OSDI 2006: 205-218 Supreeth Chakravarthy    
04/22 P19 Daniel J. Abadi, Adam Marcus, Samuel Madden, Katherine J. Hollenbach: Scalable Semantic Web Data Management Using Vertical Partitioning. VLDB 2007: 411-422 Aditya Telang Presentation and Demo Slides  
04/24   Group 1 (Supreeth Chakravarthy, Aditya Telang) and 2 (Shivkumar Chandrashekhar, Aniruddha Deshpande)      
04/29   Group 3 (Arjun Dasgupta, Xin Jin, Raghu Srinivasan) and 4 (Rahul Dhar, Robin Michael)      
05/01   Group 5 (Naved Kazi, Muhammad Safiullah) and Summary of course      
05/08   Final Report  

University calendar: Spring 2008