Computer Science Speaker Series Master Calendar

View Full Calendar

COLLOQUIUM: Pan Li, "Challenges and Opportunities in Graph Machine Learning: A Study on Graph Data Distribution Shifts"

Event Type
Seminar/Symposium
Sponsor
Illinois Computer Science
Location
HYBRID: 2405 Siebel Center for Computer Science or online
Virtual
wifi event
Date
Apr 22, 2024   3:30 pm  
Views
167
Originating Calendar
Computer Science Colloquium Series

Zoom: https://illinois.zoom.us/j/84515700484?pwd=T3FTUE9TN1MzdE56R2lYZ2ROTHMwdz09

Refreshments Provided.

Abstract: 

Graphs have been widely applied to model intricate relationships among entities. The application of Graph Machine Learning (GML) to enhance prediction capabilities for graph-structured data is prevalent in several scientific disciplines, such as particle physics, material science, and biology. However, applications in these domains often present changes in data distributions due to the label-collection process they employ. Specifically, the data used for model training often comes from thoroughly investigated regimes, whose distributions often do not align well with the under-explored regime of scientific interest. Furthermore, the interconnected nature of entities in a graph presents an additional level of complexity, making current distributionally robust methods suboptimal when being applied to graph data. 

This presentation will focus on our recent studies on GML under distribution shifts. Our studies are motivated by the observation of the data distribution shift between simulation and real experiments in particle physics. We propose a method named graph structure alignment. The key idea of our approach is to estimate and quantify shifts in entity connection patterns from the training phase to real-world evaluation. Consequently, the influence of neighboring entities on a central node can be appropriately calibrated based on prior estimations, serving to mitigate the distribution shift in graph data.

Bio:
Pan Li is an assistant professor at Georgia Tech. ECE department and holds an adjunct position at Purdue CS department. Pan's research interest lies broadly in the area of machine learning and optimization with graph data. His recent work includes algorithms and analysis for graph neural networks, hypergraph spectral methods, and their applications in physics and circuit design. Pan Li's work has been recognized by several awards including NSF Early Career Award, several industry research awards, and the Best Paper award at the Learning on Graph conference 2022.
 

Part of the Illinois Computer Science Speakers Series. Faculty Host: Han Zhao


Meeting ID: 845 1570 0484 
Passcode: csillinois


If accommodation is required, please email <erink@illinois.edu> or <communications@cs.illinois.edu>. Someone from our staff will contact you to discuss your specific needs



 

 

link for robots only