Siebel School Speaker Series Master Calendar

Machine Learning Seminar: Kumar Kshitij Patel, "When do Score-based Data Valuation Methods Work and Why?"

Mar 27, 2026   2:00 - 3:15 pm  
1214 Siebel Center
Sponsor
Research Area of Artificial Intelligence
Speaker
Kumar Kshitij Patel
Contact
Allison Mette
E-Mail
agk@illinois.edu
Originating Calendar
Siebel School Speakers Calendar
Abstract: Score-based valuation methods, such as Shapley-style scores and leave-one-out (LOO), are widely used for credit assignment in data markets, yet theory offers limited guidance on when and why they succeed. In this talk, we discuss recent work that studies these methods through the lens of best data subset selection for learning tasks. We show that even for monotone submodular valuation functions, LOO and Shapley-style scores cannot achieve a constant-factor approximation due to duplicate archetypes and collapsed pointwise credit. More broadly, boundary effects in canonical learning problems can induce supermodular spikes, ruling out constant-factor guarantees for any valuation method, including adaptive methods such as greedy selection. We identify two conditions that avert these failures: bounded curvature, which controls redundancy and restores guarantees for score-based methods, and coverage, which yields approximate submodularity over a sufficiently rich core for uniformly stable learning algorithms. We also examine the role of monotonicity and show separations between adaptive and non-adaptive methods for non-monotone valuations. Our results justify common practices such as deduplication while highlighting the importance of ensuring coverage before applying score-based selection.

The talk is based on work with Sai Praneeth Karimireddy, Raul Castro Fernandez, and Manolis Zampetakis. 

Bio: Kumar Kshitij Patel is a Postdoctoral Associate at the Yale Institute for Foundations of Data Science and a Research Fellow at the Simons Institute for Theory of Computing at the University of California, Berkeley. His research focuses on the foundations of collaborative and federated learning, with an emphasis on optimization, heterogeneity, privacy, and incentives. Kshitij received his PhD in 2025 from the Toyota Technological Institute at Chicago, where he worked with Prof. Nathan Srebro and Prof. Lingxiao Wang. He completed his undergraduate degree in computer science at the Indian Institute of Technology, Kanpur. He is a recipient of the IJCAI 2024 Distinguished Paper Award.
link for robots only