This week, I learned:
- Karya.in is creating high quality datasets. Suhel mentioned them
- An 8-year old uses Cursor.ai to code
- Hermes 3 has special tokens like
<SCRATCHPAD>, <RESTATEMENT>, <THOUGHT_*>, <PYDANTIC_SCHEMAS>, <SCHEMA_*>, <REASONING>, <INNER_MONOLOGUE>, <PLAN>, <EXECUTION>, <REFLECTION>, <THINKING>, <SOLUTION>, <EXPLANATION>, <UNIT_TEST>, etc. This extends the capability dramatically. - Lumentis creates docs from transcripts and text
- LLMs write worse code in JSON than Markdown
- Copilot’s system prompt calls a
search_enterprise(query: str)tool and ahint(M365Copilot_language: str)tool as assistants. - Anthropic Prompt Caching is 90% cheaper to use and 25% costlier to create. So if there’s a 27% chance it’ll be re-used, cache it.