blank

Professor David W.L. Cheung

BSc CUHK; MSc, PhD Simon Fraser
Honorary Professor


Tel: (+852) 2859 7072
Fax: (+852) 2559 8447
Email: dcheung<at>cs.hku.hk
Homepage: http://www.cs.hku.hk/~dcheung

Professor David Wai-lok Cheung is a Honorary Professor of the Department of Computer Science. He is also Director of the Center for E-commerce Infrastructure Development (CECID). Professor Cheung graduated with a BSc in Mathematics at Chinese University of Hong Kong, and received MSc and PhD in Computer Science from Simon Fraser University in Canada in 1985 and 1988, respectively. Since joining HKU in 1994, he has been an active researcher in database, data mining and e-commerce technologies. His recent research covers secure computation of encrypted database; data interoperability theory; query on community network.

He has published in many leading venues including SIGMOD, VLDB, ICDE and KDD conferences. He was the recipient of the HKU Outstanding Researcher Award in 1999, the Distinguished Contribution Award on the 2009 Pacific-AsiaKnowledge Discovery and Data Mining Conference. Other contributions to the academia include providing leadership in various prominent conferences - he was the program chairman of the 2001 and 2005 Pacific-Asia Knowledge Discovery and Data Mining Conferences (PAKDD); the conference chairman of the 2007 PAKDD Conference; the Conference Co-Chair of the 2009 ACM Conference on Information and Knowledge Management (CIKM); the program vice-chair of ICDM 2006; and the program chair of HKICC 2003. He has also served as program committee members for numerous other international conferences. Some of Professor Cheung's representative publications can be found on DBLP, Google Scholar and CiteSeer.

In applied research, under his directorship, CECID has received many prestigious grants from the Innovation and Technology Commission (ITC) of Hong Kong SAR Government, in a total amout of HK$60M, from 2001-2009. Together with his team, Prof. Cheung has developed an open-source ebXML gateway used by developers from more than 80+ countries. This open-source product received prominent awards in the Hong Kong Computer Societys (HKCS) 2004 IT Excellence Awards competition; the 2004 Asia-Pacific ICT Awards competition, and the 2005 Linux Business Awards competition.

Professor Cheung has provided consultancy services to many companies. In 2003, the Information and Technology Services Department (now Office of the Government Chief Information Officer) commissioned CECID to develop the XML Schema Design and Management Guide, which is now followed by all bureaux and departments in their joined-up service projects.

In public service, Prof. Cheung is currently a member of the RGC Engineering Panel as well as a board member of the Hong Kong Deposit Protection Scheme. He is also the Certification Board Chairman of the Hong Kong IT Professional Certification. He held part-time memberships at the Central Policy Unit and was a member of the Pacific Economic Cooperation Council.

Research Interests

Data engineering, data mining, outsourcing data mining, data integration, interoperability theory, semantic search, e-commerce technology, and SOA

Selected Publications

  • W. K. Wong, David W. Cheung, E. Hung, B. Kao, and Nikos Mamoulis , An Audit Environment for Outsourcing of Frequent Itemset Mining, (to appear, PVLDB, Vol. 2, issue 1, 2009), International Conference on Very Large Data Bases (VLDB 2009), Lyon, France, Sept. 2009.
  • W. K. Wong, David W. Cheung, Ben Kao, and Nikos Mamoulis, Secure k-NN Computation on Encrypted Databases, Proc. The 28th ACM SIGMOD International Conference on Management of Data (SIGMOD 2009), Providence, Rhode Island, USA, June 2009.
  • Shiming Zhang, Nikos Mamoulis and David W. Cheung, Scalable Skyline Computation Using Object-based Space Partitioning, Proc. The 28th ACM SIGMOD International Conference on Management of Data (SIGMOD 2009), Providence, Rhode Island, USA, June 2009.
  • Ben Kao, Sau Dan Lee, David W. Cheung, Wai-Shing Ho, K.F. Chan, Clustering Uncertain Data Using Veronoi Diagrams, Proc. IEEE International Conference on Data Mining (ICDM 2008), Pisa, Italy, December 2008.
  • Bin Jiang, Jian Pei, Xuemin Lin, David W. Cheung, Jiawei Han, Mining Preferences from Superior and Inferior Examples, Proc. The 14th ACM SIGKDD conference (KDD 2008), Las Vegas, USA, August, 2008.
  • Eric Lo, Ben Kao, S.D. Lee, W.S. Ho, Chun-Kit Chui, and David W. Cheung, OLAP on Sequence Data, Proc. The 27th ACM SIGMOD International Conference on Management of Data (SIGMOD 2008), Vancouver, Canada, June 2008
  • N. Mamoulis, M.L. Yiu, K.H. Cheng, David W. Cheung, Efficient Top-k Aggregation of Ranked Inputs, ACM Transactions on Database Systems, Association of Computing Machinery, 32(3), August 2007
  • Wai Kit Wong, David W. Cheung, Edward Hung, Ben Kao, and Nikos Mamoulis, Security in Outsourcing of Association Rule Mining, Proc. The 33rd International Conference on Very Large Data Bases (VLDB 2007), Vienna Austria, Sept. 2007
  • Minghua Zhang, Ben Kao, David W. Cheung and Kevin Y. Yip , Mining Periodic Patterns with Gap Requirement from Sequences, ACM Transaction on Knowledge Discovery from Data, Association of Computing Machinery, V1, I2, August 2007
  • Huiping Cao, Nikos Mamoulis, and David W. Cheung, Discovery of Periodic Patterns in Spatiotemporal Sequences, IEEE Transaction on Knowledge and Data Engineering, IEEE Computer Society, 19(4): 453-467, April 2007.
  • Lin Cheung, Kevin Y. Yip, David W. Cheung, Ben Kao, Michael Ng, On Mining Micro-array data by Order-Preserving Submatrix, International Journal of Bioinformatics Research and Applications, Inder Science Publishers, V3, I1, 2007, pages 42-64 (2007)
  • Nikos Mamoulis, K.H. Cheng, M.L. Yiu, and David W. Cheung, Efficient Aggregation of Ranked Inputs, Proc. The 22nd International Conference on Data Engineering (ICDE 2006), Altanta, GA, April 2006.
  • Wang Lian, Nikos Mamoulis, David W. Cheung, and Sui Ming Yiu, Indexing Useful Structural Patterns for XML Query Processing, IEEE Transaction on Knowledge and Data Engineering, IEEE Computer Society, V17, N17, July 2005
  • Minghua Zhang, Ben Kao, David W. Cheung and Kevin Yip, Mining Periodic Patterns with Gap Requirement from Sequences, Proc. The 24th ACM SIGMOD International Conference on Management of Data (SIGMOD 2005), Baltimore, Maryland, June 2005.
  • Kevin Y. Yip, David W. Cheung and Michael K. Ng, On Discovery of Extremely Low-Dimensional Clusters using Semi-Supervised Projected Clustering,, Proc. The 21st IEEE International Conference on Data Engineering (ICDE 2005) , Tokyo, April 2005
  • S. I. Ao, Kevin Yip, Michael Ng, David W. Cheung, Pui-Yee Fong, Ian Melhado and Pak C Sham, CLUSTAG: Hierarchical Clustering and Graph Methods for Selecting Tag SNPs, Bioinformatics, Oxford Press, V21, I8, pp.1735-1736, April, 2005.
  • Minghua Zhang, Ben Kao, C.L. Yip, and David W. Cheung, Efficient Algorithms for Mining and Incremental Update of Maximal Frequent Sequences., Data Mining and Knowledge Discovery, Kluwer Academic Publishers, Springer, V 10, N2, pp 87-116, March 2005
  • Y.T. Shou, Nikos Mamoulis and David W. Cheung, Fast and exact warping of time series using adaptive segmental approximations, Machine Learing, Kluwer Academic Publishers, 58(2-3), pp.231-267, February, 2005
  • Kevin Y. Yip, David W. Cheung and Michael K. Ng, HARP: A Practical Projected Clustering Algorithm . IEEE Transaction on Knowledge and Data Engineering, IEEE Computer Society, V16, N11, pp. 1387 - 1397, Nov. 2004.
  • Nikos Mamoulis, Huiping Cao, George Kollios, Marios Hadjieleftheriou, Yufei Tao, and David W. Cheung, Mining, Indexing, and Querying Historical Spatiotemporal Data, Proc. The Tenth ACM SIGKDD conference (SIGKDD 2004), Seattle, USA, August, 2004
  • Nikos Mamoulis, Xin Zhang, David W. Cheung and Yutao Shou, Fast Mining of Spatial Collocations, Proc. The Tenth ACM SIGKDD conference (SIGKDD 2004), Seattle, USA, August, 2004
  • W. Lian, D.W. Cheung, N. Mamoulis, and S.M. Yiu, An Efficient and Scalable Algorithm for Clustering XML Documents by Structure, Special Issue on Mining and Searching the Web, IEEE Transactions on Knowledge and Data Engineering, IEEE Computer Society, 16(1), January 2004
  • Nikos Mamoulis, David W. Cheung, and Lian Wang, The Signature Tree: An Index for Similarity Search in Categorical and Market-Basket Data, Proc. 19h IEEE International Conference on Data Engineering (ICDE-2003), Bangalora, India, March, 2003.
  • E. Hung and D.W. Cheung, Parallel Mining of Outliers in Large Database, Distributed and Parallel Databases, Kluwer Academic Publishers, 12: 5-26, 2002
  • D.W. Cheung, S.D. Lee, and Y. Xiao, Effect of Data Skewness and Workload Balance in Parallel Data Mining, IEEE Transaction on Knowledge and Data Engineering, IEEE Computer Society, 14(3):498-513, May 2002
  • D.W. Cheung, K. Hu, and S. Xia, An Adaptive Algorithm for Mining Association Rules on Shared-memory Multi-processors Parallel Machine, Distributed and Parallel Databases, Kluwer Academic Publishers, 9: 99-132, March 2001

Recent Research Grants

  • Innovation and Technology Fund (PI), An Intelligent Data Security Gateway for Multiparty Data Transfer Supported by Multi-Factor and Multi-Dimensional Data Protection Scheme (2008-2010). Amount Awarded: HK$ 6.5M
  • Innovation and Technology Fund (PI), Universal Web Service Adapter to Facilitate Software as a Service Implementation (2008-2010). Amount Awarded: HK$ 1M
  • Innovation and Technology Fund (PI), Multi-site Infrastructure for Massive Digital Content Collaboration (2008 - 2010), Amount Awarded: HK$ 3.9M
  • Innovation and Technology Fund (PI), An eLogistics Appliance with Data Exchange and Conversion Technologies for Infrastructure Connectivity (2007 - 2008), Amount Awarded: HK$ 6.6M
  • Innovation and Technology Fund (PI), Extending web 2.0 to deliver e-commerce services (2006 - 2007), Amount Awarded: HK$ 982,675
  • Innovation and Technology Fund (PI), Service oriented e-transaction platform (2006 - 2008), Amount Awarded: HK$ 9,198,850
  • RGC grant (PI) : Semi-supervised Subspace Clustering for High-Dimensional Data (2005 - 2007). Amount awarded: HK$692,480.
  • Innovation and Technology Support Program, Innovation and Technology Fund (PI) : A Business Process and Information Interoperability Platform Based on Open Standards (2003 - 2006). Amount awarded: $13,996,087. Total including industry sponsorship: $19,338,799.
  • RGC grant (PI) : Projected Clustering for High Dimensional Data and Application in Gene Expression Data Mining (2003 - 2005). Amount awarded: $551,755.
  • RGC grant (PI) : Applying Clustering Technique to Partition Large Collection of XML Documents for Fast Query Processing (2002 - 2004). Amount awarded: $433,404.
  • Innovation and Technology Support Program, Innovation and Technology Fund (PI) : Establishment of an ebXML Software Infrastructure in Hong Kong (2002 - 2003). Amount awarded : $9,539,000. Total including industry sponsorship : $10,719,000.
  • RGC grant (PI) : Applying Mining Techniques in Building Efficient OLAP System (1999 - 2001). Amount awarded : $786,000.
  • RGC grant (PI) : Mining Association Rules on High Performance Parallel Systems (1998 - 2000). Amount awarded : $795,000.
  • RGC grant (PI) : Discovery and Maintenance of Association Rules in Large Databases (1996 - 1998). Amount awarded : $416,000.
  • RGC grant (PI) : New Induction Techniques for Knowledge Discovery in Databases (1995 - 1998). Amount awarded : $541,000.