Schedule for Seminar on Massive Data Sets
Please look frequently at the
special announcements.
If you would like copies of papers presented here, send
a request to
matias+seminar@math.tau.ac.il
Association rules and their generalizations
24/3/98
Fast Algorithms for Mining Association Rules
R. Agrawal and R. Srikant.
Proc. of the 20th Int'l Conference on Very Large Databases, Santiago,
Chile, Sept. 1994. Expanded version
available as IBM Research Report RJ9839, June 1994.
Lecturer: Rami Citrom
Dynamic Itemset Counting and Implication Rules for Market Basket Data
S. Brin, R. Motwani, J.D. Ullman, S. Tsur.
1997 ACM SIGMOD Conference on Management of Data, 1997, pp. 255-264.
Lecturer: Ranen Goren
31/3/98
Sampling large databases for association rules
H. Toivonen
Proc. of the Int'l Conf. on Very Large Data Bases (VLDB), 1996.
Lecturer: Amitai Irron
The presentation is available in
ppt;
handouts are given in
doc.
Beyond Market Baskets: Generalizing
Association Rules to Correlations.
S. Brin, R. Motwani and C. Silverstein.
1997 ACM SIGMOD Conference on Management of Data, 1997, pp. 265-276.
Data Mining and Knowledge Discovery, 1998.
Lecturer: Micky Frankel
Time Sequences
7/4/98
Efficiently supporting AD Hoc Queries in
Large databases of Time sequences.
F. Korn, H.V. Jagadish, and C. Faloutsos,
Proc. SIGMOD, 1997.
Lecturer: Zipi Fligelman
Efficient Retrieval of Similar Time Sequences
Under Time Warping
B.-K. Yi, H.V. Jagadish, C. Faloutsos,
Proc. ICDE, 1998.
Lecturer: Zipi Fligelman
A summary is available
here.
14/4/98
Happy Passover!
Histograms
21/4/98
Improved Histograms for Selectivity
Estimation of Range Predicates.
V. Poosala, Y.E. Ioannidis, P.J. Haas and E.J. Shekita,
Proc. 1996 ACM SIGMOD Intl. Conf. Managment of Data, pages
294-305, 1996.
Lecturer: Ran Shaham
The presentation is available in
doc.
Fast Incremental Maintenance of
Approximate Histograms.
P.B. Gibbons, Y. Matias and V. Poosala,
Proceedings of the 23rd International Conference on Very Large
Databases (VLDB),
Athens, Greece, August 1997, pp. 466-475.
Lecturer: Uri Stav
Special Talk (note the unusual schedule)
22/4/98, 2:15-3:15
Compressing Finite Strings:
Optimal algorithms for non-asymptotic/non-probabilistic data
Guest Lecturer: Dr. S. Cenk Sahinalp
Histograms (cont.)
28/4/98
Approximate Order Statistics in One Pass and with Limited Memory
Sridhar Rajagopalan, Gurmeet Singh Manku, and Bruce Lindsay
Proc. ACM SIGMOD, 1998 (to appear).
Lecturer: Gil Arditi
The presentation is available in
doc.
5/5/98
Wavelet-Based Histograms for
Selectivity Estimation.
Y. Matias, J. S. Vitter, and M. Wang.
Proc. of the 1998 ACM SIGMOD International Conference on
Management of Data (SIGMOD '98), Seattle, Washington, June 1998 (to appear).
Lecturer: Ran Adler
Clustering techniques
12/5/98
BIRCH: an Efficient Data Clustering
Method for Very Large Databases.
T. Zhang, R. Ramakrishnan and M. Livny,
Proc. 1996 SIGMOD, pp. 103-114, 1996.
See also the BIRCH
project home page
Lecturer: Boaz Shaham
The presentation is available in
ppt.
CURE: An Efficient Clustering Algorithm for Large Databases.
S. Guha, R. Rastogi and K. Shim.
Proc. of the ACM SIGMOD Conference, 1998.
Lecturer: Yakov Zakai
The presentation is available in
doc.
Index Trees
19/5/98
Generalized Search Trees for Database Systems.
J.M. Hellerstein, J.F. Naughton, and A. Pfeffer.
Proc. 21st International Conference on Very Large Data Bases
(VLDB), Zurich, September 1995.
See also the GIST
project home page
Lecturer: Assaf Almaz
The speaker will also present indexing problems and solutions
from (real-life) products related to massive document processing.
The presentation is available in
doc.
Parallel and External Memory Algorithms
26/5/98
Asynchronous Parallel Algorithms for Mining Association Rules on a
Shared-memory Multi-processors
D.W. Cheung, K. Hu, and S. Xia.
Proc. SPAA, 1998.
Proc. 10th Annual ACM Symposium on Parallel Algorithms and
Architectures (SPAA '98), (to appear) June 1998
Lecturer: Saar Cohen
Simple Randomized Mergesort on Parallel Disks
R. Barve, E. F. Grove and J. S. Vitter.
Proc. 8th Annual ACM Symposium on Parallel Algorithms and
Architectures (SPAA '96), Padua, Italy, June 1996, 109-118.
Lecturer: Ido Safruti
The presentation is available in
doc.
2/6/98
No Seminar.
9/6/98
High-Performance Sorting on Networks of Workstations
A. C. Arpaci-Dusseau, R. H. Arpaci-Dusseau, D. E. Culler,
J. M. Hellerstein, D. A. Patterson.
Proc. of the ACM SIGMOD Conference, 1997.
Lecturer: Nadav Grossaug
Return to seminar home page
For requests or corrections contact
matias+seminar@math.tau.ac.il
Last updated June 7, 1998