VideoLM: My Absolute Cinema Factory
I conceptualized this mid-last year (not this repo/project) and later migrated the core to TypeScript to truly harden the architecture.
This full-stack video pipeline digests custom sources (PDF…
VideoLM: My Absolute Cinema Factory
I conceptualized this mid-last year (not this repo/project) and later migrated the core to TypeScript to truly harden the architecture.
This full-stack video pipeline digests custom sources (PDF, DOCX, PPTX, TXT, MP4) to craft accurate visual narratives and commercial YouTube videos automatically.
The Factory:
- Brain: Python/MCP bridge with NotebookLM extracts facts and generates audio overviews from uploads.
- Vision: Gemini 3 maps research into a 10-scene storyboard with aesthetic steering.
- Muscle: Hardened NestJS & FFmpeg backend handles b-roll sync, audio ducking, and branding.
Built to survive in VMs and Free Tier limits.
Read the repo’s README.md and docs/ to see the real architectural sweat under the hood.
AI tools were supportive senior mentors (like I’d say). Architecture, coding, and for sprint the final ship Ohh (almost forget) and for translate for English was I wrote in some parts
Gemini CLI
: Used in my dev environment. I didn’t know FFmpeg math or Docker when I started. It explained real-time errors, suggested container fixes, and guided me through deployment limits.
Perplexity**: Used to hunt updated docs and research DevOps best practices. It helped me learn Docker patterns, FFmpeg tricks, and tech stack comparisons to harden the core.




. The goal is to make sure the DEMO doesn’t break if a user doesn’t have a JWT token
. I’m building a fallback mechanism so the



Nest which already helps a lot, but depending on how many projects and things you need to host, it might not be enough… Thank God I recently created my AWS
account and managed to create VPSs to deploy my projects. I still have some free trial dollars
. I think it’s enough for 20 days or more. I didn’t mention it, but their dashboard and resource sections are very intuitive and easy to use. I liked it a lot! 


doesn’t lose track of the artifacts when the background worker picks up the heavy renders. Hardening
the persistence layer so can poll the status properly.
the reality is I was drafting the
how to pass the NotebookLM facts into the Gemini script generator.
can extract the exact facts we need without timing out the server. IT’S IN THE IMAGE IS REAL, AND YOU CAN PRATICE THESE THINGS TO SEE GOOD RESULTS. At least for me it will… and helped-me a lot)



right now. I’m hitting the endpoints manually
to ensure the payloads match the expected JSON structure before the orchestrator fully takes over. 
… Good thing I’m studying Computer Engineering. The visual pipeline is throwing syntax errors, so I’m prompting the CLI to find the exact bottlenecks before we scale. 
and planning our final moves. I was mapping out exactly what needs to be refactored in the 

lab logic. By hardening this bridge now, we ensure the frontend won’t break when we fully plug in the heavy video generation later.
) the DB connection lifecycle, I’ve made the system
, not just slide-shows (how it was a while ago)





