Commit Graph

11 Commits

Author SHA1 Message Date
Prince Canuma
6f6105b715 Add audio to video conditioning 2026-03-16 01:42:11 +01:00
Prince Canuma
cecd68197c fix tiling, rope precision and weights 2026-03-15 22:58:55 +01:00
Prince Canuma
53bae534e7 fix LTX-2.3 audio 2026-03-15 02:06:35 +01:00
Prince Canuma
e0aafd72fc Refactor generate.py to ensure temporal coordinates and position grids are processed in bfloat16 for consistency with PyTorch's precision behavior. Update denoise_dev_av function to apply standard ratio rescaling for audio and video guidance, enhancing numerical fidelity and model compatibility. 2026-03-12 21:26:38 +01:00
Prince Canuma
207c223354 Add LTX-2.3 model architecture with prompt-conditioned adaptive layer normalization (adaln) support. Introduce gating mechanisms in attention modules and update transformer configurations to accommodate new parameters. Refactor video and audio processing to utilize adaptive normalization, improving model flexibility and performance. Update weight loading and initialization logic to support dynamic block structures in the decoder. 2026-03-10 16:47:36 +01:00
Prince Canuma
ef76ec0921 add from pretrained 2026-01-23 18:13:51 +01:00
Prince Canuma
ce39e744c3 Refactor VideoEncoder to initialize from VideoEncoderModelConfig, enhancing configuration management. Add methods for weight sanitization and loading from pretrained models, improving model usability and integration with existing workflows. 2026-01-23 17:59:57 +01:00
Prince Canuma
02bfa228d9 Refactor weight loading and sanitization processes for audio models 2026-01-23 17:31:25 +01:00
Prince Canuma
f6e0e5d5a4 Update av_ca_timestep_scale_multiplier to 1000 in model configuration for consistency across modules 2026-01-16 15:59:22 +01:00
Prince Canuma
a658911f98 add audio 2026-01-16 01:15:22 +01:00
Prince Canuma
d1ca36a315 initial commit (LTX-2) 2026-01-11 23:48:33 +01:00