class number: 24053
Spring 2008 (first day of class: Jan. 15th,
2008)
Tuesday/Thursday, 2:00 - 3:20pm,
GACB 105
Syllabus Announcements Schedule Resources Project Requirements
The tentative schedule is as follows. We may change the schedule as necessary.
Paper Review:
Due at 11:59pm, two days before the scheduled paper is discussed. For example, R1 (the review of paper P1) is due at 11:59pm, Sunday, Feb. 3rd.
Paper Presentation Slides: (Only the presenter of the scheduled paper is required to submit the slides.)
Due at 11:59pm, two days before the scheduled paper is discussed. For example, S1 (the slides of paper P1) is due at 11:59pm, Sunday, Feb. 10th.
| Date | Paper# | Lecture/Activities |
Presenter |
Due |
Lecture Notes |
|
| Background Tutorial | ||||||
| 01/15 | Course Overview and Introduction | Chengkai Li | [PDF] [PPT] | |||
| 01/17 | Review of Database Management Systems | Chengkai Li | [PDF] [PPT] | |||
| 01/22 | Review of Data Mining | Chengkai Li | [PDF] [PPT] | |||
| 01/24 | Review of Web Information Retrieval | Chengkai Li | [PDF] [PPT] | |||
| 01/29 | Course Project Topics | Chengkai Li | slides in WebCT | |||
| 01/31 | Paper Review, Presentation, Research Resources | Chengkai Li | [PDF] [PPT] | |||
| Structured Querying of the Web | ||||||
| 02/05 | P1 | Michael J. Cafarella, Christopher Re, Dan Suciu, Oren Etzioni: Structured Querying of Web Text Data: A Technical Challenge. CIDR 2007: 225-234 | Group and Topic | [PDF] [PPT] | ||
| 02/07 | P2 | Eric Chu, Akanksha Baid, Ting Chen, AnHai Doan, Jeffrey F. Naughton: A Relational Approach to Incrementally Extracting and Querying Structure in Unstructured Data. VLDB 2007: 1045-1056 | [PDF] | |||
| 02/12 | P3 | Tao Cheng, Xifeng Yan, Kevin Chen-Chuan Chang: EntityRank: Searching Entities Directly and Holistically. VLDB 2007: 387-398 | Arjun Dasgupta | [PPT] | ||
| 02/14 | P4 | Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, Zachary G. Ives: DBpedia: A Nucleus for a Web of Open Data. ISWC/ASWC 2007: 722-735 | Rahul Dhar | Proposal | [PPT] | |
| Retrieval, Exploration, and Navigation of Databases | ||||||
| 02/19 | P5 | Yannis E. Ioannidis, Stratis Viglas: Conversational querying. Inf. Syst. 31(1): 33-56 (2006) | Aniruddha Deshpande | [PPT] | ||
| 02/21 | P6 | Cong Yu, H. V. Jagadish: Querying Complex Structured Databases. VLDB 2007: 1010-1021 | Raghu Srinivasan | [PPT] | ||
| 02/26 | P7 | Michalis Petropoulos, Alin Deutsch, Yannis Papakonstantinou: Interactive query formulation over web service-accessed sources. SIGMOD Conference 2006: 253-264 |
Shivkumar Chandrashekhar |
[PPT] | ||
| 02/28 | P8 | Wisam Dakka, Panagiotis G. Ipeirotis, Kenneth R. Wood: Automatic construction of multifaceted browsing interfaces. CIKM 2005: 768-775 | Raghu Srinivasan | [PPT] | ||
| 03/04 | P9 | Chris Stolte, Diane Tang, Pat Hanrahan: Polaris: A System for Query, Analysis, and Visualization of Multidimensional Relational Databases. IEEE Trans. Vis. Comput. Graph. 8(1): 52-65 (2002) | Robin Michael | [PDF] | ||
| 03/06 | P10 | Zhiyuan Chen, Tao Li: Addressing diverse user preferences in SQL-query-result navigation. SIGMOD Conference 2007: 641-652 |
Shivkumar Chandrashekhar |
[PPT] | ||
| Social Network Analysis and Collaborative Filtering | ||||||
| 03/11 | P11 | Lars Backstrom, Cynthia Dwork, Jon M. Kleinberg: Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography. WWW 2007: 181-190 | Xin Jin | [PPT] | ||
| 03/13 | P12 | Nilesh Bansal, Fei Chiang, Nick Koudas, Frank Wm. Tompa: Seeking Stable Clusters in the Blogosphere. VLDB 2007: 806-817 |
Naved Kazi |
Progress Report 1 (due at March 16th) | ||
| 03/18 |
Happy spring break |
|||||
| 03/20 | ||||||
| 03/25 | Group 1 (Supreeth Chakravarthy, Aditya Telang) and 2 (Shivkumar Chandrashekhar, Aniruddha Deshpande) | |||||
| 03/27 | Group 3 (Arjun Dasgupta, Xin Jin, Raghu Srinivasan), 4 (Rahul Dhar, Robin Michael), and 5 (Naved Kazi, Muhammad Safiullah) | |||||
| 04/01 | P13 | Nikolay Archak, Anindya Ghose, Panagiotis G. Ipeirotis: Show me the money!: deriving the pricing power of product features by mining consumer reviews. KDD 2007: 56-65 | Arjun Dasgupta | |||
| Systems and Architecture | ||||||
| 04/03 | P14 | Abhinandan Das, Mayur Datar, Ashutosh Garg, ShyamSundar Rajaram: Google news personalization: scalable online collaborative filtering. WWW 2007: 271-280 | Muhammad Safiullah | Progress Report 2 | ||
| 04/04 | Last day to drop or withdraw | |||||
| 04/08 | P15 | Seung-Taek Park, David M. Pennock: Applying collaborative filtering techniques to movie search for better ranking and browsing. KDD 2007: 550-559 | Muhammad Safiullah | |||
| 04/10 | P16 | Panagiotis G. Ipeirotis, Eugene Agichtein, Pranay Jain, Luis Gravano: To search or to crawl?: towards a query optimizer for text-centric tasks. SIGMOD Conference 2006: 265-276 | Robin Michael | |||
| 04/15 | P17 | Jayant Madhavan, Shirley Cohen, Xin Luna Dong, Alon Y. Halevy, Shawn R. Jeffery, David Ko, Cong Yu: Web-Scale Data Integration: You can afford to Pay as You Go. CIDR 2007:342-350 | Naved Kazi | |||
| 04/17 | P18 | Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Michael Burrows, Tushar Chandra, Andrew Fikes, Robert Gruber: Bigtable: A Distributed Storage System for Structured Data. OSDI 2006: 205-218 | Supreeth Chakravarthy | |||
| 04/22 | P19 | Daniel J. Abadi, Adam Marcus, Samuel Madden, Katherine J. Hollenbach: Scalable Semantic Web Data Management Using Vertical Partitioning. VLDB 2007: 411-422 | Aditya Telang | Presentation and Demo Slides | ||
| 04/24 | Group 1 (Supreeth Chakravarthy, Aditya Telang) and 2 (Shivkumar Chandrashekhar, Aniruddha Deshpande) | |||||
| 04/29 | Group 3 (Arjun Dasgupta, Xin Jin, Raghu Srinivasan) and 4 (Rahul Dhar, Robin Michael) | |||||
| 05/01 | Group 5 (Naved Kazi, Muhammad Safiullah) and Summary of course | |||||
| 05/08 | Final Report | |||||
University calendar: Spring 2008