This week, I learned:
- “People are great at feedback on what you are doing wrong. They are not so good at telling you how to fix it. They don’t know you that well.” Amit Kapoor
- Perfect Cursors makes periodic cursor positions animate smoothly by interpolating on a spline**
- CloudFlare and Vercel now support sandboxes where you can execute code. The price is not so low that we can execute for free in bulk but works well infrequent or batched code execution. Simon Willison
- Here’s how I’m using ffmpeg for video recording & editing.
- To record screen at 5 frames per second, I run an abbreviation
screenrecordwhich maps to:
- To record screen at 5 frames per second, I run an abbreviation
- Gemini CLI has a generous free tier and uses Bootstrap over Tailwind Ref #ai-coding
- Cloudflare has a native agents SDK that looks good, especially for CloudFlare users. Ref
- There are several brands with recognizable chart style guides. It’s possible to generate style guides for these from the charts, but applying them via matplotlib is almost #impossible today. ChatGPT
- Hyperfine is like %timeit for the shell. Written in Rust
- ⭐ Vertical AI is a moat against AGI. Specialization reduces hallucinations. Custom workflows and regulations are sticky and defensible. We need to start selling to users, not IT, though. Ref
- When AI automates a task, the bottleneck shifts. AI process re-design is about reworking the process around the new bottleneck, and iterating quickly.
- With coding, it’s testing, reviewing, deploying, use-case identification.
uvx git-smart-squashre-organizes haphazard commits using LLMs. git-smart-squash #ai-coding- GitHub offers a free Docker container registry. Simon Willison
- There are three major areas where humans either are, or will soon be, more necessary than ever: trust, integration and taste – NYT. Anil. To deal with this:
- Learn things that might grow in importance, like:
- Data modeling
- APIs
- Code reviews
- Drawing and 3D modeling
- Narrative storytelling
- Design
- Movie making
- Statistics
- Sceptical fact checking
- Continuous AI auditing e.g. awesome-continous-ai or automated-auditing
- Zero knowledge proofs
- Homomorphic encryption
- Privacy-preserving computation
- Fingerprinting and watermarking
- Governance frameworks
- Ethics and AI dilemmas
- Negotiation
- Change management
- Remote working, management, hiring
- Creating attention scarcity
- Local cultures
- Work with people of growing importance
- People designing products in regulated industries
- Cross domain experts
- Art developers, game makers, designers
- System thinkers. Economists, ecologists, system planners. People who look for second order effects.
- Live in cities that might play a bigger role in the future
- Cities like Singapore and learn how it builds civics trust, creates digital IDs.
- Cities like Bangalore and Hyderabad and learn how they grow tech talent
- Creative cities like Paris, Seoul, Mexico City, Berlin, etc. on sabbaticals to taste hubs
- Try to:
- Build auditing credentials and IP
- Audit your calendar for what AI can do. Have it interview you
- Practice sceptical fact checking and audit
- Learn things that might grow in importance, like:
- A clever way to test a library’s quality is to have LLMs write code from docs and test it. Failing libraries have flawed code/docs. Improve. Ref #ai-coding
- Common Pile is an 8TB open dataset for LLM training that includes ArXiv, PubMed, StackExchange, GitHub, IRC, Regulations.gov, Patents, UK parliament, books. Easier than scraping.
- A useful way to have reasoning models do deep-research-like work is to have them “First, create a plan to solve the problem, clearly listing the objective, approach, and output. Then follow the plan.”
- DE-COP is a method to check if LLMs were trained on private content. GPT-4o was trained on O’Reilly books, based on this method. Ref
- LLMs are more persuasive than humans. But repeated exposure reduces the effect. Ref
- Phoenix.new uses live views to publish apps as it codes. The testing framework looks at the screen while it codes and fixes errors. It commits every change
- Anthropic system prompt asking Claude to pursue its goals led to self preservation behavior. Ref
- The hungrier I am the better the food tastes. A good reason to eat less quantity and frequency
- You can purge the jsDelivr cache manually. Helps if you released a new version of a package and way to purge an alias (e.g.
https://cdn.jsdelivr.net/npm/your-package@1) - XConvert is a convenient online app to compress .webm videos. Not great design but fairly good compression.
- You can draw a treemap of import times via
python -X importtime app.py > timing.txtand then paste them at https://kmichel.github.io/python-importtime-graph/. - PyOpenLayers adds interactive mapping via OpenLayers to Marimo and Jupyter.
- In a TechCrunch interview with Jared Kaplan has was asked if Anthropic is becoming less safety conscious because they released Opus 4 which blackmails. Kaplan replied that they have stronger testing and higher transparency, so they’re more likely to share AI dangers early. Great positioning! Conversations are about perspective change and this nailed it.
- The system prompts for Anthropic misalignment evals are a fascinating read.
- AI PR Watcher tracks GitHub pull requests from Codex and other LLMs. Codex is way ahead of anything else on volume and success rate. Devin is next on volume, Cursor is next on success rate.