This week, I learned:
- When running a Hello world app:
- FastAPI takes ~26K RAM, 3% CPU
- NodeJS + Express takes ~62K RAM, 2% CPU
- Deno + Express takes ~62K RAM, 1% CPU
- Deno + Fresh takes ~54K RAM, 0.4% CPU
- I was testing out different video LLMs:
- Luma Labs lets you create videos from text
- Runwal ML lets you create video from an image + text
- Viggle lets you add images to a video or move a character in a certain way
- Veed.io is a video editor that offers AI video editing features
- Deepmotion generates 3D animations from video
- Wonder Dynamics may be similar to DeepMotion
- I tested out a few audio LLMs:
- Reflection 70b is one of the top models now, and is open source!. It works by making the LLM reflect on its answer inside
<reflection>...</reflection>tags. - The best diarization model today is whisperX. Run on Colab T4 GPU with:
- Scale’s SEAL Leaderboards seem fairly good.
- coedit-xxl is Grammarly’s fine-tuned google/flan-t5-xxl model run on CoEdit - text editing dataset. It’s mainly for single-line editing, though, and far from a full-document or full-email zero-shot editor.