Intro to AI Series: Introduction to Large Language Models (LLM)

Archit Vasan, ALCF
Student Training/Education Beginner
Vasan Session 4 Graphic

From October 1 - November 12, 2024, the ALCF will host a 7-part weekly virtual training series to teach undergraduates and graduates the fundamentals of using world-class supercomputers to advance the use of AI for research.
 

Intro to AI Series: Session 4

Trainees will learn how computer models generate and comprehend natural language. The session will cover the architecture of large language models, input tokenization, and practical applications.

Lecturer

Archit Vasan is a postdoctoral appointee in the Argonne Leadership Computing Facility with a background in computational biophysics. His research interests at ALCF involve the discovery of cancer drugs using machine Learning coupled to exascale computing. Archit received a BA in Physics and Mathematics from Austin College in 2016. He then received his PhD in Biophysics from the University of Illinois at Urbana-Champaign in 2023 under the guidance of Dr. Emad Tajkhorshid. 

AI for Science Talk Speaker

Nicola Ferrier is a senior computer scientist as part of the Mathematics and Computer Science division at Argonne National Laboratory. Ferrier's research interests are in the use of computer vision (digital images) to control robots, machinery, and devices, with applications as diverse as medical systems, manufacturing, and projects that facilitate ​“scientific discovery” (such as her recent project using machine vision and robotics for plant phenotype studies). She will be speaking on  AI @ Edge.