- about
- docs
- ideas
- more
- now
-
posts
- 2025
- 2026
- ai-for-physics
- AuroraGPT
- dope-slides
- drafts
- ezpz-at-alcf
- ezpz-v1
- jupyter
- resume
- svgbob
- torchtune-aurora
- … 3 more
- projects
- showcase
-
talks
- 2025
- 2026
- ai-for-science-2024
- alcf-hpc-workshop-2024
- aurora-gpt-fm-for-electric-grid
- AuroraGPT
- AuroraGPT-SIAM25
- demo-slides
- hpc-user-forum
- incite-hackathon-2025
- llms-at-scale
- llms-on-polaris
- … 2 more
-
webtui
- components
- contributing
- installation
- plugins
- start
- index.mdx
- _redirects
- index.astro
- index.xml
- listings.json
- robots.txt
- search.json
- sitemap.xml
Sam Foreman
Computational Scientist · Argonne National LaboratoryI'm a Computational Scientist in the AI / ML Group at the Argonne Leadership Computing Facility.
My work focuses on large-scale distributed training of foundation models for scientific applications. I co-lead the Models & Pre-Training team for the AuroraGPT project.
Recent Posts
Latest writing and notes
| Title | Date |
|---|---|
| Running 50k Python Processes on Aurora with ezpz yeet How `ezpz yeet` distributes Python environments to every worker node in an HPC job, and how it scales from 8 to 4096 nodes on Aurora. | 05/26 |
| Pre-Training AuroraGPT with TorchTitan Pre-training AuroraGPT with TorchTitan and ezpz: Last Two Weeks (Apr 12–27, 2026) | 04/26 |
| ⏱️ Comparing Launchers on Aurora Benchmarking and comparing the performance of different launchers on Aurora at ALCF: `torchrun` vs. `ezpz launch` | 02/26 |
| 🍋 ezpz: distributed PyTorch across any hardware A history and overview of `ezpz`, with AMD and Intel PyTorch enablement timelines and why portable distributed training across GPU vendors is finally possible. | 01/26 |
| 🎉 Happy New Year! A New Year update summarizing ongoing projects including AuroraGPT, AERIS, and other involvements at Argonne. | 01/26 |
Recent Talks
Presentations and workshops