This week, I learned:

  • LanceDB is a more scalable alternative to ChromaDB. Written in Rust. Does not require a separate HSNW library.
  • Meta has a bunch of image embedding models:
    • DINOv2 creates image embeddings (Apr 2023)
    • ImageBind is an embedding model for text, images, audio, and more (Jun 2023)
  • Gemini has a code execution API!
  • 0x0.st is an open API-based file upload + URL shortening service. You can dump files there temporarily.
  • noVNC is a JavaScript VNC client. You can control a remote (virtual) machine from your browser.
  • Friend is an always recording pendant that you can ask questions to.
  • Anthropic’s new Sonnet model is even better at code. Plus it has the ability to extract coordinates from images. Ref
  • Gemini sort-of supports diarization. Ref. I tried it and it’s OK but not perfect.
    • #IMPOSSIBLE LLMs cannot diarize reliably yet. (Gemini just guesses the speaker differences.)
  • Replit is good for hobbyists, Cursor for developers, and Pythagora & Bolt for non-developers building business apps. Ref