HPDM 2008



Past Editions

Bethesda, HPDM'06
Newport Beach, HPDM'05
Lake Buena Vista, HPDM'04
San Francisco, HPDM'03
Washington, HPDM'02
San Francisco, PDDM'01
Cancun, HPDM'00
San Juan, HPDM'99
Orlando, HPDM'98

Workshop History

After several editions linked with the International Parallel & Distributed Processing Sym-posium (1998-2001), and by the SIAM Data Mining conference (2002-2006), this year HPDM is organized in conjunction with the IEEE International Conference on Data Mining.
The upcoming theme of the workshop is support and ex-ploitation of emerging computer architectures such as multi-core CPUs and streaming GPUs. We are looking forward to a very special and exciting 10th edition in Pisa!

Past Proceedings

IPDPS Workshops 2001
IPDPS Workshops 2000
IPDPS Workshops 1999
IPDPS Woskshops 1998


10th International Workshop on
High Performance Data Mining
(in conjunction with ICDM)
Pisa, December 15th 2008

Check the workshop progam here.

Call For Papers

The HPDM workshop focuses on the issues of high performance, distributed and parallel computation in knowledge discovery and data mining. Theoretical advances, algorithm and systems, as well as application cases are welcome contributions. The workshop will take into consideration also relevant work-in-progress papers.

Over the years the definition of high performance computing evolved according to the opportunities provided by new technologies and to the needs of emerging industrial and scientific applications. Nowadays, high performance data mining includes the quest for fast algorithms that target performance by overcoming memory and I/O bottlenecks, for mining techniques that work in grid and cluster computing frameworks, and for solutions that exploit distributed and mobile computation. Innovative computing platforms, and solutions designed to cope with complex memory hierarchies can help meet the severe performance requirements in managing huge amounts of data.

This year the workshop aims at organizing a special theme session exploring emerging trends in computer architectures. We seek advances in exploiting multicore CPUs and GPUs in mining algorithms and techniques.

The workshop welcomes papers (including works in progress) on all aspects of high performance data mining, including, but not limited to:

  • Data mining exploiting multi-core CPUs and/or GPUs
  • Disk-based and memory-hierarchy aware algorithms
  • High performance data-stream mining and management
  • Mining of streaming multimedia content
  • Efficient, scalable, parallel/distributed solutions
  • Parallel and distributed KDD frameworks and systems
  • Distributed and peer-to-peer data mining techniques
  • Distributed techniques for incremental, exploratory and interactive mining
  • Mining techniques for network management
  • Data mining in mobile environments
  • Scalable techniques for mining social networks
  • Theoretical foundations for resource-aware mining in mobile, streaming, distributed environments
  • Applications of parallel and distributed data mining in business, science, engineering, medicine, and other disciplines

Workshop Proceedings

All accepted papers will be included in the ICDM '08 Workshop Proceedings volume published by the IEEE Computer Society Press. We are considering whether to organize a journal special issue embracing the topics of the workshop. Extended versions of selected papers will be invited to submit.

Important Dates

  • Submissione deadline extended to August 7, 2008
  • September 23, 2008 : Notification of paper acceptance
  • October 7, 2008 : Camera-ready of accepted papers

Please check the Author's Final Formatting and Submission Instructions webpage.

Papers Submission

The Submission Form to submit your work is available here. Paper submissions should be limited to a maximum of 10 pages in the IEEE 2-column format, (see the IEEE Computer Society Press Proceedings Author Guidelines).
At least one author per paper will have to register at ICDM Conference and present the paper at the workshop. There is no separate workshop fee.

Invited Talk

The keynote talk was given by Prof. Srinivasan Parthasarathy, check details here.

Accepted Papers

  1. Marco Lackovic, Domenico Talia, and Paolo Trunfio, “Service Oriented KDD: A Framework for Grid Data Mining Workflows” slides
  2. Alan Ratner, “Mining Unstructured Text at Gigabyte per Second Speeds” slides
  3. Eugenio Cesario and Domenico Talia, “Distributed Data Mining Models as Services on the Grid” slides
  4. Christoph Weyerhaeuser, Tobias Mindnich, Franz Faerber, and Wolfgang Lehner, “Exploiting Graphic Card Processor Technology to Accelerate Data Mining Queries in SAP NetWeaver BIA” slides
  5. Haimonti Dutta and Hillol Kargupta, “Distributed Linear Programming and Resource Management for Data Mining in Distributed Environments” slides
  6. Gabor Takacs, Istvan Pilaszy, Bottyan Nemeth, and Domonkos Tikk, “Investigation of Various Matrix Factorization Methods for Large Recommender Systems” slides
  7. Pitchaimalai et al. “Efficient Distance Computation Using SQL Queries and UDFs”
  8. Jie Ouyang et. al “Chi-Square Test Based Decision Trees Induction in Distributed Environment”
  9. Wang et al. “Parallel Hierarchical Clustering on Market Basket Data”
  10. Ranganath and Narasimha “Stream-Close: Fast mining of Closed Frequent Itemsets in high speed data streams”

Workshop Co-Chairs

  • Greg Buehrer, Microsoft Live Labs
  • Massimo Coppola, ISTI-CNR
  • Claudio Lucchese, ISTI-CNR

Steering Committee

  • Hillol Kargupta, University of Maryland, Baltimore County
  • Vipin Kumar, University of Minnesota
  • Srinivasan Parthasarathy, Ohio State University
  • David Skillicorn, Queens University
  • Mohammed Zaki, RPI

Program Committee

  • Francesco Bonchi, Yahoo! Research Barcelona
  • Toon Calders, Eindhoven Technical University
  • Yen-Kuang Chen, Intel Research
  • Amol Ghoting, IBM TJ Watson
  • Aristides Gionis, Yahoo! Research Barcelona
  • Hillol Kargupta, University of Maryland, Baltimore County
  • Aleksander Kolcz, Microsoft Live Labs
  • Vipin Kumar, University of Minnesota
  • Anthony Nguyen, Intel Research
  • Salvatore Orlando, University of Venice
  • Srinivasan Parthasarathy, Ohio State University
  • Raffaele Perego, ISTI-CNR, Italy
  • Fabrizio Petrini, IBM TJ Watson
  • Christophe Rigotti, INSA Lyon
  • David Skillicorn, Queens University
  • Domenico Talia, University of Calabria
  • Philip S. Yu, University of Illinois
  • Mohammed Zaki, RPI




HPDM 08 is sponsored by

Keynote talk abstract is on-line!
Workshop Program is on-line!
List of accepted papers is now available.
Notifications have been sent. Camera-ready are due on October 7th
Notifications will be sent on September 22nd
Aug 01: Deedline is extended to August 07
June 08: Submissions are open
May 27: The web site is now

Important dates
Aug. 7, 2008: Submission
Sept. 23, 2008: Notification
Oct. 7, 2008: Camera-ready

Submission Site
Submission Form

ICDM 2008
IEEE International Conference
on Data Mining