Issue #12 โ Sora-Level Open Models & New VBench Records
Published 2026-03-03 ยท 6 papers ยท 3 generation ยท 3 editing
OpenSora v2: Scaling Video Diffusion to 1080p with 3D-VAE
Introduces a 3D variational autoencoder that enables 1080p video generation at 24fps with significantly reduced memory cost.
| Metric | Score | ฮ |
|---|---|---|
| VBench Total | 84.3 | +2.1 |
| FVD (UCF-101) | 198 | -47 |
| CLIPSIM | 0.312 | +0.008 |
| Temporal Consistency | 97.1 | +1.4 |
VideoEdit-XL: Instruction-Based Video Editing with Flow Matching
A flow-matching approach for instruction-guided video editing that preserves temporal coherence across 128-frame clips.
| Metric | Score | ฮ |
|---|---|---|
| Edit Accuracy | 91.2% | +3.8% |
| Temporal Consistency | 95.6 | +2.3 |
| CLIPSIM | 0.298 | |
| FVD | 312 | -28 |
CogVideoX-5B: Scaling Expert Mixture for Long Video Generation
Mixture-of-experts architecture generating coherent 60-second videos with consistent characters and scene transitions.
| Metric | Score | ฮ |
|---|---|---|
| VBench Total | 83.7 | +1.9 |
| FVD (UCF-101) | 211 | -35 |
| Subject Consistency | 96.8 | +2.1 |
| CLIPSIM | 0.307 | +0.005 |
StableVideo 3.0: Diffusion Transformers for 4K Video Inpainting
DiT-based inpainting model that handles 4K resolution video with spatiotemporal attention for seamless object removal.
| Metric | Score | ฮ |
|---|---|---|
| Inpainting PSNR | 34.2dB | +1.8dB |
| LPIPS | 0.042 | -0.011 |
| Temporal Flicker | 0.98 | +0.03 |
MotionForge: Physics-Aware Video Generation with World Models
Combines a learned world model with diffusion generation to produce physically plausible motion and object interactions.
| Metric | Score | ฮ |
|---|---|---|
| VBench Physics | 78.9 | +5.2 |
| FVD (Kinetics) | 267 | -41 |
| Motion Realism | 88.4 | +3.7 |
VidStyler: Zero-Shot Video Style Transfer via Adaptive LoRA
Applies artistic styles to video with zero-shot generalization using adaptive low-rank adaptation modules per frame.
| Metric | Score | ฮ |
|---|---|---|
| Style Accuracy | 87.3% | +4.1% |
| Temporal Consistency | 94.2 | +1.9 |
| CLIPSIM | 0.285 |
Want full archive access?
Upgrade to Pro for $5/month and get every past issue plus early Tuesday delivery.
See pricing โ