This week, I learned:

  • DuckDB supports function chaining
  • DuckDB lets you create functions = macros
  • HTML for People is a nice introduction to HTML.
  • FlightRadar24 lets you watch airplanes live.
  • sq is like jq but for SQL.
  • Deno 2 is fully backward compatible with Node! via
  • O1 is good at solving problems where the solution is easy to verify and generating options helps get closer to the solution
  • Reverb ASR does diarration as well as transcription. It seems the state of art right now.
  • Gemini Flash and Gemini Flash 8b can be fine-tuned at zero cost. Inference is at the same price! Ref
  • Flux 1.1 Pro is released. I tried my Calvin & Hobbes test on it. Not great. ImageGen3 is better, ChatGPT is the best. Ref
  • Revisiting text to speech models. Nothing much has changed since July 2024.
    • OpenAI TTS: $15/1M chars Ref
    • Deepgram Aura: $15/1M chars Ref
    • Azure AI Speech: $15/1M chars Ref
    • Google TTS Neural2: $16/1M chars Ref
    • AWS Polly Neural TTS: $16/1M chars Ref
    • Cartesia Pro: $50/1M chars Ref
    • Elevenlabs Scale: $300/1M chars Ref
  • GitHub co-pilot workspaces let you code using your mobile with AI and deploy it at one shot
  • If you need an Ubuntu Docker container with Python, install it via uv rather than compiling from source. via
  • VTracer is an open source library (and tool) to convert raster images to SVGs. via
  • If you want to create a console.llm() function, a browser extension is the best way, because some pages have Content-Security-Policy that block eval, form submission, fetch from other domains, and script execution.
  • PyPi lets you publish from GitHub Actions without a token. Also from Gitlab.com CI/CD and Google Cloud.
  • ActiveState which made ActivePython, ActivePerl, etc. made these products paid for commercial use around 2013 after a series of acquisitions.
  • Marimo supports:
    • Publishing any notebook to static.marimo.app as a static app
    • Creating a SINGLE link that embeds the ENTIRE notebook in the URL!
    • Runnable via uvx marimo edit
  • Parables on the Power of Planning in AI: Giving models about 30 seconds of thinking time consistently improves results - as much as increasing parameter size by a factor of 1,000 to 100,000!
    • This works particularly well for verifiable results (code, math, etc.)
    • Technique: Ask an LLM hundreds of times at low temperature and pick the most common one. (Google’s Minerva used this on the MATH dataset.)
    • Better Technique: Ask an LLM hundreds of times. Pick the best solution based on an evaluation metric (reward model)
    • Better Technique: Apply a reward model at EACH step of the process. OpenAI’s “Let’s Verify Step by Step”
  • Late chunking is an interesting approach to adding context to embeddings. (I don’t understand it, but it’s cheap and effective.)
  • DeepInfra offers embedding models as APIs at about 0.5 to 1 cent per MTok in an OpenAI compatible API. It also supports text-to-image models like flux.dev and speech recognition models like Whisper.
  • Jake Heller:
    • “One of the things we learned is (an LLM app) after it passes passes frankly even 100 tests, the odds that it will do, on any random distribution of user inputs, the next 100,000 100% accurately is very high.”
  • OpenAI’s O1 is like Daniel Kahneman’s System 2 thinking - as against other LLMs’ System 1 thinking.
  • Continue.dev is another AI coding editor. It supports OpenRouter. So now I have heard good things about:
    • Github Copilot
    • Cursor
    • Cody
    • Continue.dev (supports OpenRouter)
    • Aider (supports OpenRouter)
    • Maybe:
      • Codeium
    • Not:
      • Amazon Q Developer