Seminar – Managing Information on the Web
Tova Milo, Winter 2007
Seminar Information
The seminar focuses on managing, analyzing, sharing, and integrating data
and applications across multiple sources, either on the Internet or at
enterprises. This topic has received much attention in the database, AI, Web,
IR and verification communities. We shall read recent papers in this area,
focusing on several specific issues, then explore possible future directions. A
list of tentative topics/papers is enclosed.
- Efficient Query
Evaluation on Probabilistic Databases, Nilesh Dalvi, Dan Suciu, VLDB
‘04
(Daniel 1/11)
- The
Dichotomy of Conjunctive Queries on Probabilistic Structures, Dalvi,
Suciu, PODS ‘07
- Learning
Probabilistic Relational Models , Nir
Friedman, Lise Getoor, Daphne Koller, Avi Pfeffer, IJCAI ‘99
- On
the Complexity of Managing Probabilistic XML Data, Pierre
Senellart and Serge Abiteboul, PODS ‘07
- Matching
Twigs in Probabilistic XML, Benny Kimelfeld, Yehoshua Sagiv, VLDB ‘07
- Optimal
aggregation algorithms for middleware, Fagin, Lotem, Naor, PODS ‘01
- Best
Position Algorithms for Top-K Queries, Akbarnia, Pacitti, Valduriez,
VLDB ’07.
- Efficient
Top-k Query Evaluation on Probabilistic Data, Christopher Re, Nilesh
Dalvi, Dan Suciu, ICDE ‘07 (Zvi 22/11)
- A
Bayesian Method for Guessing the Extreme Values in a Data Set, Wu,
Jermaine, VLDB ‘07
- Anytime
Measures for Top-K algorithms, Arai, Das, Gunopulos,
Koudas, VLDB ‘07 (Itay 20/12)
[ note that there is a class on 20/12 – see above in "top-k queries"]
- Novel
Data Mining Applications
1.
The Complexity of
Reasoning about Pattern-based XML Schemas, Gjergji Kasneci and
Thomas Schwentick, PODS ‘07
2.
Polynomial Time Fragments of XPath with Variables, Emmanuel Filiot, Joachim Niehren,
Jean-Marc Talbot and Sophie Tison, PODS ‘07