The School of Computing and Data Science (https://www.cds.hku.hk/) was established by the University of Hong Kong on 1 July 2024, comprising the Department of Computer Science and Department of Statistics and Actuarial Science and Department of AI and Data Science.

×

Warning

Sorry - this event cannot be viewed using the link provided. Its possible the item may have been updated while you have been viewing the webpage.
Events for
Upcoming Seminars and Events
December 18, 2025
  • Title: Towards Consistent and Physically Plausible Visual Generation

    Time: 11:00am 

    Venue: CB 308

    Speaker(s): Departmental Seminar by Prof. Jianfei Cai, Monash University

    Remark(s): 

    Abstract

    Recent advances in large language models (LLMs) and multimodal large language models (MLLMs) have significantly enhanced the understanding and encoding of textual information. Leveraging these capabilities, a growing number of diffusion-based generative models have emerged for text-conditioned visual generation — spanning text-to-image, text-to-video, and text-to-3D tasks. While these models offer remarkable flexibility and produce increasingly realistic content, they still face fundamental challenges: aligning precisely with user intent, maintaining spatial, view, and temporal consistency, and adhering to the laws of physics. In this talk, I will present several recent research projects from my group that attacks these challenges. PanFusion enforces global consistency in text-to-panorama image generation; MVSplat360 uses image conditions and explicit 3D representation to enhance view consistency of 3D generation. VLIPP integrates physics-informed priors to ensure physically plausible text-to-video generation. I will conclude by pointing out the limitations and discussing future directions such as developing world models.

    About the speaker

    Jianfei Cai is a Professor at Faculty of IT, Monash University, where he had served as the inaugural Head for the Data Science & AI Department. Before that, he was Head of Visual and Interactive Computing Division and Head of Computer Communications Division in Nanyang Technological University (NTU). His major research interests include computer vision, deep learning and multimedia. He is a co-recipient of paper awards in ACCV, ICCM, IEEE ICIP and MMSP, and a winner of Monash FIT’s Dean's Researcher of the Year Award and Monash FIT Dean's Award for Excellence in Graduate Research Supervision. He serves or has served as an Associate Editor for TPAMI, IJCV, IEEE T-IP, T-MM, and T-CSVT as well as serving as Senior/Area Chair for CVPR, ICCV, ECCV, ACM Multimedia, ICLR and IJCAI. He was the Chair of IEEE CAS VSPC-TC during 2016-2018. He had served as the leading TPC Chair for IEEE ICME 2012, the best paper award committee chair & co-chair for IEEE T-MM 2020 & 2019, and the leading General Chair for ACM Multimedia 2024. He is a Fellow of IEEE.

     

December 19, 2025
  • Title: Towards Scalable Serverless LLM Inference Systems

    Time: 10:30am 

    Venue: CB 308

    Speaker(s): Prof. Minchen Yu

    Remark(s): 

    Abstract

    Serverless computing has become a compelling cloud paradigm for model inference due to its high usability and elasticity. However, current serverless platforms suffer from significant cold-start overhead---especially for large models---limiting their ability to deliver low-latency, resource-efficient inference. In this talk, I will present three systems we built for scalable serverless inference. First, Torpor proposes node-level GPU pooling that enables fine-grained GPU sharing and fast model swapping. Second, LambdaScale leverages high-speed interconnects to scale models across nodes and performs pipelined inference for lower latency. Third, for emerging large mixture-of-experts (MoE) models, we design fine-grained expert scheduling with elastic scaling to improve the cost-effectiveness of MoE inference.

    About the speaker

    Minchen Yu is an Assistant Professor at the School of Data Science, The Chinese University of Hong Kong, Shenzhen. He received his Ph.D. from Hong Kong University of Science and Technology. His research interests cover cloud computing and distributed systems, with a recent focus on serverless computing and machine learning systems. His research has been published at various prestigious venues, such as NSDI, ATC, EuroSys, INFOCOM, and SoCC, and has been applied in leading cloud platforms, such as Alibaba Cloud. He received the Best Paper Runner-Up Award at IEEE ICDCS 2021.

  • Title: LLM based Zero Shot Speech Synthesis

    Time: 02:00pm 

    Venue: CB 308

    Speaker(s): Dr. Shujie Liu

    Remark(s): 

    Abstract

    With the rapid development of large language models (LLMs) in natural language processing, speech LLMs have also begun to receive increasing attention. In this talk, we will introduce VALL‑E, a zero‑shot text‑to‑speech (TTS) synthesis approach built upon large language models. Leveraging the in‑context learning capabilities of LLMs, VALL‑E can generate high‑quality, personalized speech using only a three‑second audio prompt from an unseen speaker. Building upon this foundation, we will further introduce several extensions of VALL‑E, including: VALL‑E X (the multilingual version), VALL‑E 2(addressing stability issues), PALLE(combining AR and NAR modelling), MELL‑E and FELLE (based on continuous speech representations).

    About the speaker

    Shujie Liu is a Principal Researcher at MSRA Hong Kong. His research focuses on natural language processing, speech processing, and machine learning. He has published over 100 papers in top-tier conferences and journals in NLP and speech, co‑authored the book Machine Translation, and contributed to Introduction to Artificial Intelligence. He has won multiple first‑place awards in international NLP and speech evaluation campaigns and has served as a reviewer and area chair for several major conferences. His research has been widely deployed in Microsoft products, including Microsoft Translator, Skype Translator, Microsoft IME, and Microsoft Speech Services.




Division of Computer Science,
School of Computing and Data Science

Rm 207 Chow Yei Ching Building
The University of Hong Kong
Pokfulam Road, Hong Kong
香港大學計算與數據科學院, 計算機科學系
香港薄扶林道香港大學周亦卿樓207室

Email: csenq@hku.hk
Telephone: 3917 3146

Copyright © School of Computing and Data Science, The University of Hong Kong. All rights reserved.
Don't have an account yet? Register Now!

Sign in to your account