Prince Canuma
576e01da14
Implement linking of text encoder and tokenizer directories in conversion process. Enhance error handling in LTX2TextEncoder for tokenizer loading, providing a fallback model if the specified path is unavailable.
2026-03-09 18:25:32 +01:00
Prince Canuma
9f37dab076
Refactor model loading in generate.py to use dynamic model paths for audio and video components. Simplify weight loading logic in LTX2TextEncoder to accommodate both monolithic and reformatted model structures. Introduce a check for existing model paths in get_model_path function to enhance robustness.
2026-03-09 15:51:21 +01:00
Prince Canuma
4cd58f8b26
Refactor LTX2TextEncoder to utilize Rich for progress tracking during token generation. Replace tqdm with Rich's Progress for enhanced console output and user experience. Clean up imports and streamline the generation process.
2026-01-19 02:13:10 +01:00
Prince Canuma
883c6b0ad8
ensure dtype cast
2026-01-17 13:03:48 +01:00
Prince Canuma
e4cdbb7eab
add vae tiling
2026-01-17 07:51:54 +01:00
Prince Canuma
5f86e881d7
Update top_p parameter in sampler function to 1.0 for enhanced sampling control in LTX2TextEncoder
2026-01-16 21:08:14 +01:00
Prince Canuma
a658911f98
add audio
2026-01-16 01:15:22 +01:00
Prince Canuma
81daf3f67d
Add prompt enhancement feature to video generation
...
- Introduced `enhance_prompt`, `max_tokens`, and `temperature` parameters in `generate_video` function for improved prompt handling.
- Implemented prompt enhancement logic using the new `enhance_t2v` method in the text encoder.
- Added command-line arguments for prompt enhancement options.
- Created new system prompt files for T2V and I2V generation to guide the enhancement process.
2026-01-15 14:31:00 +01:00
Prince Canuma
f5134fa172
adjust gelu and precision
2026-01-15 12:49:21 +01:00
Prince Canuma
957093c29b
use numpy for improved float64 precision and performance
2026-01-14 00:03:00 +01:00
Prince Canuma
74af04718d
Remove commented-out code and clean up text encoder initialization
2026-01-13 23:31:54 +01:00
Prince Canuma
ea063f7550
Cast LM weights to bfloat16
2026-01-13 23:30:26 +01:00
Prince Canuma
fc6ef20c1b
Add custom text encoder with quantization
...
Co-authored-by: HimanshU Mourya <40685364+codingstark-dev@users.noreply.github.com >
2026-01-13 22:56:51 +01:00
Prince Canuma
01d895bc77
Add frame number validation in video generation and update Gemma3 text encoder to use validated mlx-vlm implementation
2026-01-13 17:12:11 +01:00
Prince Canuma
7114b023bd
- Refactor video generation script
...
- Introduced argparse for parameter handling, streamlined model loading, and enhanced denoising functions.
- Updated VAE weight sanitization for compatibility and improved activation function handling in text projection.
- Added support for saving individual frames and refined output video generation process.
2026-01-12 14:04:53 +01:00
Prince Canuma
d1ca36a315
initial commit (LTX-2)
2026-01-11 23:48:33 +01:00