andrei
filatov
Main
Blog
Realtime Video Generation Calculator
First principles estimate — adjust parameters, get concrete numbers
—
per 1 second of video
—
Tokens
—
PFLOP/step
—
Realtime ratio
Model
Approach
Diffusion
AR Streaming
Parameters
1.3B
5B
14B
30B
72B
200B
Steps
4
1
25
50
VAE Compression
8×8 (Wan2.1)
16×16 (Wan2.2)
32×32 (future)
Sparse Attention
Off
2×
4×
6×
CFG
On (×2)
Off
Base: LongLive 1.3B
FP16 (20.7 FPS on H100)
FP8 (24.8 FPS on H100)
INT8 (mobile)
INT4 (mobile)
Hardware
Platform
Server GPU
Mobile NPU
GPU
H100 (2023)
B200 (2025)
B300 (H2 2025)
Rubin (2026)
Rubin Ultra (2027)
Feynman (2028)
Feynman Ultra (2029)
Next Gen (2030)
Precision
FP16
FP8
FP4 (Blackwell+)
GPU Count
1
1
4
8
NPU TOPS
80
20
750
1500
Target Video
Resolution
480p
720p
1080p
FPS
12
16
24