Seminar – Advanced Topics in Web Data Management

 

Tova Milo, Daniel Deutch, 2017/18

 

Meetings: Wednesdays 17-19

 


Seminar Information

 

The seminar focuses on managing, analyzing, sharing, and integrating data and applications on the web. Areas of interest include

crowdsourcing, data exploration, Big Data, probabilistic data and data provenance. We shall read recent

papers in this area, focusing on several specific issues, and then explore possible future directions. A tentative list of

papers is enclosed.

 

 

Schedule (Sem B)

 

 

March 18: Yuval, Slava, Yehonatan

 

April 8: Amir, Ariel

 

April 22: Tomer H., Nave

 

May 6: Shai, Amit

 

May 27: Shevach, Tomer W., Chai

 

June 10: Brit, Naama, Ori

 

 

Schedule (Sem A)

 

 

Nov 8

 

Slava+Eyal

 

Understanding Workers, Developing Effective Tasks, and Enhancing

 Marketplace Dynamics: A Study of a Large Crowdsourcing Marketplace

 

 

Nov 15: No meeting

 

Nov 22

 

Yuval

 

Effortless Data Exploration with zenvisage: An Expressive and

Interactive Visual Analytics System

 Tarique Ashraf Siddiqui     Albert Kim     John Lee     Karrie

 Karahalios     Aditya Parameswaran

 

 

 

Nov 29

 

Oded+ Amit

 

A Declarative Query Processing System for Nowcasting

 

 

Dec 6: No meeting

 

Dec 13:

 

Tomer W + Ori

 

KBQA: Learning Question Answering over QA Corpora and Knowledge Bases

 

 

Dec 20

 

Tomer H. + Yehonatan

 

I’ve Seen “Enough”: Incrementally Improving Visualizations to Support

Rapid Decision Making

 

 

 

Dec 27

 

Leonid Libkin + short group presentations

 

Jan 3: Amir + Nave

 

SILKMOTH: An Efficient Method for Finding Related Sets with Maximum Matching Constraints 

 

 

Jan 10

 

Chai+Shevach

 

CleanM: An Optimizable Query Language for Unified Scale-Out Data Cleaning

 

 

Jan 17

 

Brit + Shay

 

Revisiting the Stop-and-Stare Algorithms for Influence Maximization

 

 

 

 

Papers

 

 

Data cleaning

 

 HoloClean: Holistic Data Repairs with Probabilistic Inference

 Theodoros Rekatsinas     Xu Chu     Ihab Ilyas     Chris Re

 

 CleanM: An Optimizable Query Language for Unified Scale-Out Data Cleaning

 Stella Giannakopoulou     Manos Karpathiotakis     Benjamin

Gaidioz     Anastasia Ailamaki

 

 Knowledge Verification for LongTail Verticals - research Furong Li

 Xin Luna Dong     Anno Langen     Yang Li

 

 

(Social) Graphs and streams

 

 A Declarative Query Processing System for Nowcasting

 Dolan Antenucci     Michael Anderson     Michael Cafarella

 

 When Engagement Meets Similarity: Efficient (k,r)-Core Computation on

Social Networks

 Fan Zhang     Ying Zhang     Lu Qin     Wenjie Zhang     Xuemin Lin

 

 READS: A Random Walk Approach for Efficient and Accurate Dynamic SimRank

 Minhao Jiang     Ada Wai Chee Fu     Raymond Chi-Wing Wong     Ke Wang

 

 Revisiting the Stop-and-Stare Algorithms for Influence Maximization

 Keke Huang     Sibo Wang     Glenn Bevilacqua     Xiaokui Xiao

 Laks Lakshmanan

 

 

Approximate Query Processing

 

 Revisiting Reuse for Approximate Query Processing

 Alex Galakatos     Andrew Crotty     Emanuel Zgraggen     Carsten

 Binnig     Tim Kraska

 

 Probabilistic Database Summarization for Interactive Data Exploration

 Laurel Orr     Dan Suciu     Magdalena Balazinska

 

 Data Driven Approximation with Bounded Resources

 Yang Cao Wenfei Fan

 

 

Crowdsourcing

 

 Understanding Workers, Developing Effective Tasks, and Enhancing

 Marketplace Dynamics: A Study of a Large Crowdsourcing Marketplace

 Ayush Jain     Akash Das Sarma     Aditya Parameswaran Jennifer Widom

 

 Truth Inference in Crowdsourcing: Is the Problem Solved?

 Yudian Zheng     Guoliang Li     Yuanbing Li     Caihua Shan Reynold Cheng

 

 A Data Quality Metric (DQM): How to Estimate the Number of Undetected

Errors in Data Sets

 Yeounoh Chung     Sanjay Krishnan     Tim Kraska

 

 

Visualization and data exploration

 

 

 

 

 Effortless Data Exploration with zenvisage: An Expressive and

Interactive Visual Analytics System

 Tarique Ashraf Siddiqui     Albert Kim     John Lee     Karrie

 Karahalios     Aditya Parameswaran

 

 

Question answering