Prince Canuma
|
53bae534e7
|
fix LTX-2.3 audio
|
2026-03-15 02:06:35 +01:00 |
|
Prince Canuma
|
e0aafd72fc
|
Refactor generate.py to ensure temporal coordinates and position grids are processed in bfloat16 for consistency with PyTorch's precision behavior. Update denoise_dev_av function to apply standard ratio rescaling for audio and video guidance, enhancing numerical fidelity and model compatibility.
|
2026-03-12 21:26:38 +01:00 |
|
Prince Canuma
|
207c223354
|
Add LTX-2.3 model architecture with prompt-conditioned adaptive layer normalization (adaln) support. Introduce gating mechanisms in attention modules and update transformer configurations to accommodate new parameters. Refactor video and audio processing to utilize adaptive normalization, improving model flexibility and performance. Update weight loading and initialization logic to support dynamic block structures in the decoder.
|
2026-03-10 16:47:36 +01:00 |
|
Prince Canuma
|
ef76ec0921
|
add from pretrained
|
2026-01-23 18:13:51 +01:00 |
|
Prince Canuma
|
ce39e744c3
|
Refactor VideoEncoder to initialize from VideoEncoderModelConfig, enhancing configuration management. Add methods for weight sanitization and loading from pretrained models, improving model usability and integration with existing workflows.
|
2026-01-23 17:59:57 +01:00 |
|
Prince Canuma
|
02bfa228d9
|
Refactor weight loading and sanitization processes for audio models
|
2026-01-23 17:31:25 +01:00 |
|
Prince Canuma
|
f6e0e5d5a4
|
Update av_ca_timestep_scale_multiplier to 1000 in model configuration for consistency across modules
|
2026-01-16 15:59:22 +01:00 |
|
Prince Canuma
|
a658911f98
|
add audio
|
2026-01-16 01:15:22 +01:00 |
|
Prince Canuma
|
d1ca36a315
|
initial commit (LTX-2)
|
2026-01-11 23:48:33 +01:00 |
|