Seminar – Advanced Topics in Web Data
Management
Tova Milo,
Daniel Deutch, 2017/18
Meetings: Wednesdays 17-19
Seminar Information
The seminar focuses on managing, analyzing,
sharing, and integrating data and applications on the web. Areas of interest
include
crowdsourcing, data
exploration, Big Data, probabilistic data and data provenance. We shall read
recent
papers in this
area, focusing on several specific issues, and then explore possible future
directions. A tentative list of
papers is
enclosed.
Schedule (Sem B)
March 18: Yuval, Slava,
Yehonatan
April 8: Amir, Ariel
April 22: Tomer H.,
Nave
May 6: Shai, Amit
May 27: Shevach, Tomer W., Chai
June 10: Brit, Naama,
Ori
Schedule (Sem A)
Nov 8
Slava+Eyal
Understanding Workers, Developing Effective
Tasks, and Enhancing
Marketplace Dynamics: A Study of a Large Crowdsourcing Marketplace
Nov 15: No meeting
Nov 22
Yuval
Effortless Data Exploration with zenvisage: An Expressive and
Interactive Visual Analytics System
Tarique Ashraf Siddiqui Albert
Kim John Lee Karrie
Karahalios Aditya Parameswaran
Nov 29
Oded+ Amit
A Declarative Query Processing System for Nowcasting
Dec 6: No meeting
Dec 13:
Tomer W + Ori
KBQA: Learning Question Answering over QA
Corpora and Knowledge Bases
Dec 20
Tomer H. + Yehonatan
I’ve Seen “Enough”:
Incrementally Improving Visualizations to Support
Rapid Decision Making
Dec 27
Leonid Libkin + short
group presentations
Jan 3: Amir + Nave
SILKMOTH: An
Efficient Method for Finding Related Sets with Maximum Matching Constraints
Jan 10
Chai+Shevach
CleanM: An Optimizable
Query Language for Unified Scale-Out Data Cleaning
Jan 17
Brit + Shay
Revisiting the Stop-and-Stare Algorithms for
Influence Maximization
Papers
Data cleaning
HoloClean: Holistic Data Repairs with Probabilistic
Inference
Theodoros Rekatsinas Xu Chu Ihab Ilyas Chris Re
CleanM: An Optimizable Query
Language for Unified Scale-Out Data Cleaning
Stella Giannakopoulou
Manos Karpathiotakis Benjamin
Gaidioz
Anastasia Ailamaki
Knowledge Verification for LongTail
Verticals - research Furong Li
Xin Luna Dong
Anno Langen
Yang Li
(Social) Graphs and streams
A Declarative Query Processing System for Nowcasting
Dolan Antenucci Michael
Anderson Michael Cafarella
When
Engagement Meets Similarity: Efficient (k,r)-Core
Computation on
Social Networks
Fan
Zhang Ying Zhang Lu Qin
Wenjie Zhang Xuemin Lin
READS: A
Random Walk Approach for Efficient and Accurate Dynamic SimRank
Minhao Jiang Ada Wai Chee
Fu Raymond Chi-Wing Wong Ke Wang
Revisiting the Stop-and-Stare Algorithms for
Influence Maximization
Keke Huang Sibo Wang Glenn Bevilacqua Xiaokui Xiao
Laks Lakshmanan
Approximate Query Processing
Revisiting Reuse for Approximate Query
Processing
Alex Galakatos Andrew Crotty Emanuel Zgraggen Carsten
Binnig
Tim Kraska
Probabilistic Database Summarization for
Interactive Data Exploration
Laurel
Orr Dan Suciu Magdalena Balazinska
Data
Driven Approximation with Bounded Resources
Yang Cao
Wenfei Fan
Crowdsourcing
Understanding Workers, Developing Effective
Tasks, and Enhancing
Marketplace Dynamics: A Study of a Large Crowdsourcing Marketplace
Ayush Jain Akash Das Sarma Aditya Parameswaran Jennifer Widom
Truth
Inference in Crowdsourcing: Is the Problem Solved?
Yudian Zheng Guoliang Li Yuanbing Li Caihua Shan Reynold Cheng
A Data
Quality Metric (DQM): How to Estimate the Number of Undetected
Errors in Data Sets
Yeounoh Chung
Sanjay Krishnan Tim Kraska
Visualization and data exploration
Effortless Data Exploration with zenvisage: An Expressive and
Interactive Visual Analytics System
Tarique Ashraf Siddiqui Albert
Kim John Lee Karrie
Karahalios Aditya Parameswaran
Question answering