AI Dev 25 x NYC | Robert Crowe: JAX Made Simple: An Intuitive Guide to Building Fast Neural Networks

Foundation Models5 months ago

AI Dev 25 x NYC | Robert Crowe: JAX Made Simple: An Intuitive Guide to Building Fast Neural Networks

Descriptions:

Robert Crowe, a product manager at Google with extensive AI experience, delivered a fast-paced technical overview of JAX at AI Dev 25 NYC, aimed at ML practitioners who already train models and want to understand where the framework fits in the modern deep learning stack. Crowe traces JAX’s origins to Google’s need for something more flexible and modular than TensorFlow—a system built for both massive-scale production training and rapid research iteration, now also used across scientific computing domains including bioinformatics and genomics.

The technical core of the talk covers JAX’s composable function transformations: `jit` for just-in-time compilation, `grad` for automatic differentiation, `vmap` for vectorization, and `shard_map` for fine-grained sharding, all compiled by the XLA layer into optimized machine code for GPUs and TPUs. Crowe then walks through the three main distributed training strategies—data parallelism (DDP) for models that fit in a single accelerator’s memory, fully sharded data parallelism (FSDP) for models that don’t, and tensor parallelism for splitting individual layers when memory constraints are extreme. JAX’s SPMD paradigm abstracts the hardware topology, making multi-device programs look like single-device programs.

A November 2023 scaling study showing near-ideal throughput efficiency to over 50,000 TPUs provides concrete evidence of JAX’s production readiness. Crowe closes with an overview of the broader ecosystem—Flax for neural network building, Optax for optimization, and other libraries—along with resource links for viewers who want to go deeper on any covered topic.

📺 Source: DeepLearningAI · Published December 05, 2025
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

DeepLearningAI

1 Item

Companies

No Image Available

Google

Tags

Anthropic Google Nvidia TPU Transformers

Prev

n8n Tutorial for Beginners 2026: How to Build AI Agents

n8n Tutorial for Beginners 2026: How to Build AI Agents

Next

World Models & General Intuition: Khosla’s largest bet since LLMs & OpenAI

World Models & General Intuition: Khosla’s largest bet since LLMs & OpenAI

18 Related Posts

Related Posts

31:55

Foundation Models

The biggest AI breakthrough in medicine & drug discovery

1 day ago

01:20:07

Foundation Models

Mind the Gap (In your Agent Observability) — Amy Boyd & Nitya Narasimhan, Microsoft

1 day ago

25:53

Foundation Models

The Trillion Dollar Agentic Workflow Opportunity Is Here

1 day ago

20:09

Foundation Models

Pinecone Just Demoted Vector Search. Here’s the Knowledge Layer.

2 days ago

14:27

Foundation Models

Claude Makes Dashboards Too Easy. That’s the Problem.

2 days ago

18:37

Foundation Models

CI/CD Is Dead, Agents Need Continuous Compute and Computers — Hugo Santos and Madison Faulkner

2 days ago