This week, I learned:
- Wrong answers are useful if you discover why they said that. Conversation is a game where you CO-CONSTRUCT common ground. Mike Caulfield
- BMTC hourly data from Bangalore Metro is available via RTI. Vivek
- “Find evidence for and against” improves LLM responses far more than “Are you sure?” Mike Caulfield
- SSH3 is an emerging SSH alternative that’s written on top of HTTP/3. It supports OAuth2, OpenID Connect, and HTTPS for certificates.
- Cholesterol has become a victim of its own success. We give statins to those with high LDL. So most people who have heart attacks have lower-than-natural cholesterol. Inflammation (HS-CRP) is now the strongest predictor of heart attack (American College of Cardiology). The usual stuff reduces HS-CRP: no sugar/carbs, veggies, nuts, green tea, turmeric/black pepper, weight loss, exercise, sleep, meditation.
- ⭐ The beginner mindset: scrub your instincts and don’t let life experience cloud you. This takes effort. Hold on to naivette and escape cynicism. The Knowledge Project: Barry Diller
- Forecasts give comfort. They may not be good but they feel safer than instinct. The Knowledge Project: Barry Diller
- My laptop’s mic is much better than my phone’s mic, surprisingly. When recording conversations, it’s better to leave my laptop open and record than use the phone’s recording app.
- ⭐ Here are the major not-immediately-obvious LLM megatrends/superpowers I see.
- Swarms. Ask for dozens of solutions in parallel. Merge, rank, auto-debate, converge.
- Personalize at Scale. Create feedback, designs, excerpts/summaries, … tailored to EACH person at scale.
- Computer use. Agents operate UIs like a human (browser, apps).
- LLM-as-a-judge. Use AI to validate ever-increasing AI generated output.
- Synthetic data. Create realistic data for prototypes, testing edge cases, market research simulation, training data, …
- Code on demand. Ask for outcomes directly. Agents code on the fly to get there, in data science, research, management, …
- Style transfer. Copy a master’s style of drawing, coding, writing, … creating an army of their apprentices.
- Multi-modality. Native voice/video/screensharing and long-context perception
- Citizen experts. Non-expertise is not a barrier. Amateurs can create expert-level films, music, software, reports, …
- Long-context LLMs. Growing context size lets us process entire repos, legal libraries, personal lifelogs, …
- Memory. Assistants learn per-person / per-team. Cuts prompt, builds knowledge.
- Agent-to-Agent. Agents consuming content (e.g.
llms.txt), agents calling agents (sub-agents, A2A protocol, …) - Real-world tools. Write reports, send emails, shop online, use computer, control devices, …
- Jagged frontier. AI is great at certain things but terrible at others. This frontier is unknown and shifting rapidly.
- Lethal trifecta. You can only have 2 out of these 3: private data, untrusted content, and external communication.
- Edge/Private AI. Small models on private cloud compute.
- Authenticity. What content is authentic? What’s slop? What’s fraud? Are AI twins liable?
- AI Governance. Strict liability, transparency mandates, state control, …
- Not sure about or haven’t seen enough of these:
- Data / workflow as the moat
- AI native business models
- AI digital-divide
- ⭐ What I’d like to do next, maybe, is build a boutique “AI Studio”. Small group of good people coding delightful AI problems. Something that doesn’t scale.
- GLM models can be used with Claude Code. At $3/month and a quality close to Claude 4 Sonnet, this is a good deal. But the effort of adding a new subscription is too high for me. I’d rather use it via OpenRouter which is doesn’t support an Anthropic API end point at the moment.
typstis a good LaTeX alternative. Markdown-like syntax with fast rendering. Mostly useful for researchers using LaTeX. But publishers / journals don’t accept typst often.libSQLis an SQLite compatible fork with remote access, replication, ALTER TABLE to modify columns, random ROWID, etc. It supports the same externsions. The maintainers are working onturso- a SQLite compatible improvement with async, vectors, change data capture, etc. (still in alpha). But because of this, I’m a bit uncertain about the future oflibSQL.- ⭐ LLM benchmarks show a correlation of ~0.5, hinting at a common theme of intelligence. Correlations in coding & science are particularly high. Ethan Mollick. Reminds me of student marks correlations. Strong correlation clusters (physics, chemistry, biology, mathematics, computer science) with the weaker correlations going down to ~0.5. What does it indicate? LLMs learn like people? Knowledge areas cluster? Humans write benchmarks like exams?
- Dayflow records your screen at 1 fps and uses Gemini to summarise your activity every 15 min. Has low CPU usage.
- ⭐ Code Mode is a smart way to use MCPs and a very likely future direction. Using LLMs to write code to call MCPs rather than directly.
- Cloudflare supports an AI Index which will eliminate the need for a lot of custom RAG engineering.