From e3bda17c7d8100d4c59d09083cafc403ec5197cb Mon Sep 17 00:00:00 2001 From: Adolfo Reyna Date: Thu, 26 Feb 2026 21:26:34 -0500 Subject: [PATCH] Complete project history with Phase 6 --- history.md | 6 ++++++ 1 file changed, 6 insertions(+) diff --git a/history.md b/history.md index 06f8d78..560d283 100644 --- a/history.md +++ b/history.md @@ -46,3 +46,9 @@ The project now features a high-performance, Apple Silicon-optimized pipeline th - Refactored the script to support a dictionary of multiple `MarianMT` models. - Each transcribed English segment is passed through each loaded translation engine sequentially. - **Performance on M2:** Loading 3-4 specialized models + Whisper is highly efficient, using ~1.5GB of RAM and providing near-instant results. + +## Phase 6: Memory & Generation Safety +- **Issue:** Occasionally, long inputs or model glitches caused "runaway" translation generation, which could consume excessive memory. +- **Solution:** + - Artificially truncated input transcription to a maximum of 250 characters. + - Added `max_new_tokens=150` to the translation generation call to ensure the model terminates even if it gets stuck in a loop.