Things I Learned - 23 Mar 2025

This week, I learned: If we can DESCRIBE what good looks like, training data is no gap. We can auto optimise models towards that. That’s RLF. DeepSeek R1 side stepped the need for training data by creating reward functions and prompts. This tells the fine tuning process how to go correct as it goes along. This video is the first one that really help me understand what’s going on. I was born in the Ananda year in the Tamil and Telugu calendars. ChatGPT Andrej Karpathy’s note taking mechanism is similar to mine, except I use Microsoft TODO. Ref I have 3 categories. Things I learnt, which I just note. Things to explore, which I can delegate, defer, drop, or do at any time. Things to do, which are the hardest and pile up. Alexander Doria shares an interesting perspective on the app space. Model is the product Models are natively absorbing app capability and will become killer systems internalising workflows like Chat, Deep Research, Claude Code, Operator, etc. to wipe out the apps and workflow space. Models will “internalize” tool capabilities Opinionated or focused training will be a lever and model providers will acqui-hire the successful trainers API access from model providers will shrink. Selling tokens is not a viable business model given lowering costs The huggingface_hub cache-system uses symlinks by default to efficiently store duplicated files. To support symlinks on Windows, you either need to activate Developer Mode or to run Python as an administrator. In Windows, you can enable offline files for any SMB share via: Control Panel → Sync Center → Manage offline files and turn on the feature. Then, in File Explorer, right‑click the mapped network folder or drive and select “Always available offline.” OpenAI now supports PDFs natively in the API. (Gemini has done so for a while) Anger is a trigger for change. “Either change yourself or the environment, else you’ll be uncomfortable.” HocusPocus allows live collaboration e.g. editing together Block notes is a notion like library for editor components. Converts to Markdown Oxidizr enables replacing Linux tools with Rust equivalents. Emoji Kitchen lets you create stickers from emoji combinations. Another way of scaling LLMs is generating multiple options and self evaluating. Eric Zhao duckdb -ui launches a DuckDB notebook. This is built into newer DuckDB releases Monolith downloads web pages as a single HTML file by embedding content. Archgw is an LLM proxy/router from the makers of Envoy proxy. There’s an annotated Terry Pratchett! Gemini API allows YouTube videos as a part. Google agents.json is a proposal for discovery of agents on a site that enhances the Open API spec: wild-card-ai/agents-json Since Gemini Flash 2.0 is now an image GENERATION model, interactive VISUAL fiction is now a cool possibility. People are using it in interesting ways: Interleaved storytelling, Memes, Surrealism.