OCR Master Calendar - Select Centers + Engineering

Directions in Deep Learning Hardware

Feb 19, 2026   4:00 - 5:00 pm  
1002 Grainger Auditorium
Sponsor
Varun Chandrasekaran, Ph.D.
Speaker
William Dally, Ph.D. - Stanford University
Contact
Varun Chandrasekaran, Ph.D.
E-Mail
varunc@illinois.edu
Views
68
Originating Calendar
Illinois ECE Distinguished Colloquium Series

Abstract:

The current resurgence of artificial intelligence, including generative AI like ChatGPT, is due to advances in deep learning. Systems based on deep learning are transforming all aspects of life. Deep learning has been enabled by powerful, efficient computing hardware. The algorithms used have been around since the 1980s, but it has only been in the last decade - when powerful GPUs became available to train networks - that the technology has become practical.   Advances in DL are now gated by hardware performance.   Demand for training operations has increased 10 Million fold  in the last decade and is currently growing by 16x per year.  The autoregressive nature of the decode phase of LLM inference places particularly heavy demands on latency and memory bandwidth.  The trend toward agentic systems places additional demands on latency.  This talk will review the history of hardware for deep learning and how this hardware is adapting to the latest challenges.

Bio:

Bill is Chief Scientist and Senior Vice President of Research at NVIDIA Corporation and an Adjunct Professor and former chair of Computer Science at Stanford University. Bill is currently working on developing hardware and software to accelerate demanding applications including machine learning, bioinformatics, and logical inference.  He has a history of designing innovative and efficient experimental computing systems.  While at Bell Labs Bill contributed to the BELLMAC32 microprocessor and designed the MARS hardware accelerator. At Caltech he designed the MOSSIM Simulation Engine and the Torus Routing Chip which pioneered wormhole routing and virtual-channel flow control. At the Massachusetts Institute of Technology his group built the J-Machine and the M-Machine, experimental parallel computer systems that pioneered the separation of mechanisms from programming models and demonstrated very low overhead synchronization and communication mechanisms.  At Stanford University his group developed the Imagine processor, which introduced the concepts of stream processing and partitioned register organizations, the Merrimac supercomputer, which led to GPU computing, and the ELM low-power processor.  Bill is the 2025 recipient of the Benjamin Franklin Medal for Computing and Cognitive Science and has been named one of the recipients of the 2025 Queen Elizabeth Prize for Engineering.   Bill is a Member of the National Academy of Engineering, a Fellow of the IEEE, a Fellow of the ACM, and a Fellow of the American Academy of Arts and Sciences.  He has received the ACM Eckert-Mauchly Award, the IEEE Seymour Cray Award, the ACM Maurice Wilkes award, the IEEE-CS Charles Babbage Award, the IPSJ FUNAI Achievement Award, the Caltech Distinguished Alumni Award, and the Stanford Tau-Beta-Pi Teaching Award.  He was a member of President Biden’s council of advisors on science and technology.  He currently leads projects on computer architecture, network architecture, circuit design, and programming systems. He has published over 250 papers in these areas, holds over 160 issued patents, and is an author of four textbooks.

 

 https://research.nvidia.com/person/william-dally


link for robots only