Jack Min Ong

The future is still unknown.
Build anyway.

Writing, tools, visual explainers, and life chapters from the edge between machine learning infrastructure and becoming more alive.

Read latest -> Explore apps ->

About

Jack Min Ong, oversized curiosity.

This site is shifting from a traditional personal page into a branded home for ideas: a place for essays, utility apps, diagrams, and the mythology that makes the work feel alive.

Portrait slot

Hey, lorem ipsum...

Builder of small systems, strange maps, and useful explanations.

Placeholder intro for the next pass. It can mention Kuala Lumpur, San Francisco, Prime Intellect, GPUs, family, friends, taste, and the deeper thread underneath all the building.

SF / KL ML systems Visual notes

Placeholder quote from someone who has seen the drafts before the launch.

Friend, future testimonial

Placeholder quote about taste, velocity, and making confusing systems feel usable.

Collaborator, future testimonial

Placeholder quote for the person who knows the Malaysia to SF arc up close.

Friend, future testimonial

Writing

Notes from the frontier, sorted like a small newsroom.

All posts ->

gpu 2026-03-18

Fused MoE + LoRA: Fitting Heterogeneous Adapters in a Single Forward Pass

Designing a fused kernel that applies per-expert LoRA adapters inside the MoE dispatch/combine cycle.

gpu 2026-02-24

Notes on Blackwell SM100: What Actually Changed for Kernel Writers

A practical breakdown of what SM100 means if you're writing CUDA kernels — TCGEN05, UMMA, TMEM, and what you can ignore.

infra 2026-01-11

KV Cache Recompute vs. Offload — When Does Each Win?

Benchmarks and intuition for choosing between KV cache recompute and CPU offload in LLM serving.

ml 2025-11-30

Shipping INTELLECT-3: Lessons from Async RL on 512 H200s

What we learned training INTELLECT-3 with prime-rl's disaggregated async RL architecture.

Mini Apps

Small tools for planning, charts, and explainers.

World clocks ->

Live

World Clocks

Find meeting overlap across SF, New York, Europe, Malaysia, and whatever timezone gets added next.

--:--

Placeholder

Daily Planning

A tiny operating board for today, this week, and the thing that matters most.

Placeholder

Chart Studio

Quick visual sketches for throughput, money, habits, and systems that need a shape.

Placeholder

GPU Explainers

Little interactive diagrams for kernels, memory, inference, and cluster behavior.

Placeholder

Field Notes Index

A future shelf for daily logs, public scratchpads, and visual breadcrumbs.

Placeholder

System Pulse

A small dashboard for live-ish status, rituals, or the machines that keep the site moving.

vLLM cluster

K8s nodes

NVLink mesh

RDMA fabric

Uptime: 0d 0h 0m

Mem: -- / 141 GB HBM3e

Lore

Manhwa chapters for the arc underneath the work.

The polished version can become illustrated panels. For now, these are chapter placeholders: a loose timeline with room for mystery.

2024

Roots and Rooms

Family, friends, and the first version of a life that feels designed on purpose.

Prime Labs

Training systems, GPU clusters, and the strange joy of making throughput visible.

Curiosity Engine

A chapter for questions that refuse to stay tidy.

Unknown Future

The page where the map ends and the work keeps going.

Long Arc

A placeholder for the far chapter, still private, still being earned.

The future is still unknown.Build anyway.

Jack Min Ong, oversized curiosity.

Builder of small systems, strange maps, and useful explanations.

Notes from the frontier, sorted like a small newsroom.

Fused MoE + LoRA: Fitting Heterogeneous Adapters in a Single Forward Pass

Notes on Blackwell SM100: What Actually Changed for Kernel Writers

KV Cache Recompute vs. Offload — When Does Each Win?

Shipping INTELLECT-3: Lessons from Async RL on 512 H200s

Small tools for planning, charts, and explainers.

World Clocks

Daily Planning

Chart Studio

GPU Explainers

Field Notes Index

System Pulse

Manhwa chapters for the arc underneath the work.

Roots and Rooms

Prime Labs

Curiosity Engine

Unknown Future

Long Arc

The future is still unknown.
Build anyway.