mlx-video

Author	SHA1	Message	Date
Prince Canuma	6f6105b715	Add audio to video conditioning	2026-03-16 01:42:11 +01:00
Prince Canuma	cecd68197c	fix tiling, rope precision and weights	2026-03-15 22:58:55 +01:00
Prince Canuma	53bae534e7	fix LTX-2.3 audio	2026-03-15 02:06:35 +01:00
Prince Canuma	e0aafd72fc	Refactor generate.py to ensure temporal coordinates and position grids are processed in bfloat16 for consistency with PyTorch's precision behavior. Update denoise_dev_av function to apply standard ratio rescaling for audio and video guidance, enhancing numerical fidelity and model compatibility.	2026-03-12 21:26:38 +01:00
Prince Canuma	207c223354	Add LTX-2.3 model architecture with prompt-conditioned adaptive layer normalization (adaln) support. Introduce gating mechanisms in attention modules and update transformer configurations to accommodate new parameters. Refactor video and audio processing to utilize adaptive normalization, improving model flexibility and performance. Update weight loading and initialization logic to support dynamic block structures in the decoder.	2026-03-10 16:47:36 +01:00
Prince Canuma	ef76ec0921	add from pretrained	2026-01-23 18:13:51 +01:00
Prince Canuma	ce39e744c3	Refactor VideoEncoder to initialize from VideoEncoderModelConfig, enhancing configuration management. Add methods for weight sanitization and loading from pretrained models, improving model usability and integration with existing workflows.	2026-01-23 17:59:57 +01:00
Prince Canuma	02bfa228d9	Refactor weight loading and sanitization processes for audio models	2026-01-23 17:31:25 +01:00
Prince Canuma	f6e0e5d5a4	Update av_ca_timestep_scale_multiplier to 1000 in model configuration for consistency across modules	2026-01-16 15:59:22 +01:00
Prince Canuma	a658911f98	add audio	2026-01-16 01:15:22 +01:00
Prince Canuma	d1ca36a315	initial commit (LTX-2)	2026-01-11 23:48:33 +01:00

11 Commits