diff --git a/history.md b/history.md index 06f8d78..560d283 100644 --- a/history.md +++ b/history.md @@ -46,3 +46,9 @@ The project now features a high-performance, Apple Silicon-optimized pipeline th - Refactored the script to support a dictionary of multiple `MarianMT` models. - Each transcribed English segment is passed through each loaded translation engine sequentially. - **Performance on M2:** Loading 3-4 specialized models + Whisper is highly efficient, using ~1.5GB of RAM and providing near-instant results. + +## Phase 6: Memory & Generation Safety +- **Issue:** Occasionally, long inputs or model glitches caused "runaway" translation generation, which could consume excessive memory. +- **Solution:** + - Artificially truncated input transcription to a maximum of 250 characters. + - Added `max_new_tokens=150` to the translation generation call to ensure the model terminates even if it gets stuck in a loop.