Things I Learned - 30 Jun 2024

This week, I learned: Amara’s law: “We tend to overestimate the effect of a technology in the short run and underestimate the effect in the long run.” LLM Patterns include Evals, RAG, Fine-tuning, Caching, Guardrails, Defensive UX, Collect feedback. Notably: Defensive UX: Microsoft, Google, and Apple have guidelines for Human-AI interactions Collect feedback: Explicit and implicit Rouge and Context Precision are metrics to evaluate LLM responses that serve as a starting point – but not sufficient, usually Any word with the letters izehsglbo can be spelt on a calculator. That includes Hobbes (538804)! Via Calculator spelling Tor Browser + DuckDuckGo is good for torrent searches. Maybe the Dark Web IS the original Internet. The ad-free hacker web

Things I Learned - 23 Jun 2024

This week, I learned: Luma Labs Dream Machine generated videos. It’s free and is of reasonable quality. Update: 6 Jun 2025. Costs $10/month LLM DataHub has LLM training datasets, regularly updated From Dan Becker on running a workshop Answer questions at the end, not in parallel in a chat, to avoid distraction Have fewer words in slides when presenting. It’s less distracting Morgan Housel Shane Parrish podcast Risk is what stops you from achieving YOUR goals. What’s risky for me may not be risky for you The lesson from compounding is that you want to optimize for duration, not return. That’s what does the heavy lifting. Survival, consistency, long term - these matter. The performance does NOT matter.

Things I Learned - 09 Jun 2024

This week, I learned: httpretty can mock ALL Python HTTP libraries Japanese pray to dead parents instead of gods. The dead are preserved in plates by priests. Japanese are generally non religious Looks like GPT-4o is using CNNs to create vector embeddings of images, with images gridded into a 1x1, 2x2, etc. PLUS OCR. Ref The sum of a sinusoidal series is like a spirogram. Spinning circle linked to another and so on https://www.andreinc.net/2024/04/24/from-the-circle-to-epicycles

Things I Learned - 02 Jun 2024

This week, I learned: Modal.com seems of offer reasonably priced GPUs Combining vector search and keyword search with reciprocal rank fusion seems to work well for RAG. Ref Knowledge Project podcast. Morgan Housel Differences of opinion exist because of different stories arising from origins and experiences. We are not debating facts. We are debating life lessons! Solution: hear their anecdotes. The stories that taught them their lessons. AI reporting templates are a trend. Domain expertise comes in via structuring the report template and associated prompts. Some audio embedding models: unoti/voice-embeddings, retkowsky/audio_embeddings, pyannote/embedding (for speaker similarity), and more. Hidden Brain podcast: Innovation 2.0: The power of less Subtraction is hard because we are biologically and economically wired against it. It’s also hard because there are fewer markers of subtraction. Additions are natural markers / triggers. Marie Kondo suggests keeping only what sparks joy #POST I tried Undermind.ai - an agent that researches for you. It guides you to ask a detailed question, spends 2-3 minutes finding the answer, and provides detailed results. But it’s worth the wait. It’s a good alternative to quick validations on SciSpace. For popular results, search actually makes results worse! When not to trust language models Perception of fluency and usefulness are NEGATIVELY correlated in LLM! Evaluating Verifiability in Generative Search Engines GPTs are now available to non paying users. Apparently for a few weeks! Everyone also has limited access to GPT-4o. Discussion with Anand Explore BBC Microbit Everyone should get a Raspberry Pi! Watch 2 minutes paper on YouTube More LLM routers: LiteLLM: Open source, OpenAI compatible, 100+ LLMs RouteLLM: Open source, OpenAI compatible, automatically routes based on cost OpenRouter: OpenAI compatible API, several models Unify: Supports many models Portkey: Supports popular providers Martian: Limited set of models d-id and Heygen can modify videos of a person.

Things I Learned - 26 May 2024

This week, I learned: My home WiFi is on WiFi 6. This supports beam-forming which increases range by “focusing” on devices! Predibase lets you run fine-tuned models at the same price, on a per-token basis. 25c/MTok up to 21B models. That’s sames as Claude 3 Haiku, but with fine-tuning. RunPod’s vLLM endpoint lets you run any HuggingFace LLM with an OpenAI API priced on usage (serverless) not on idle time. “Autoscaling to 0”. Portkey is an LLM router

Things I Learned - 19 May 2024

This week, I learned: In Scandinavia, Århus comes after Zürich because Å is a different letter. It was added by the Dutch after WW2 to distance themselves from the Germans. via Zalgo text is where we combine multiple Unicode combining characters Artificial Analysis benchmarks LLM APIs on speed, cost, and quality.

Things I Learned - 12 May 2024

This week, I learned: Radio free Xp podcast. Nudge 61 always announce first before doing. Give people time to plan comment and react. That gets you alignment without sacrificing freedom. give information, not orders. When someone is parking a car, tell them how much space they have, don’t tell them to start stop or how much to turn left it’s almost impossible to change the culture if you’re not the boss

Things I Learned - 05 May 2024

This week, I learned: Hidden brain podcast. Innovation 2.0 solve your own problem. Don’t solve other people’s problems. This helps you pick what you’re good at affordable losses. Make sure you survive borrow others’ spares. spare time, scrap data, anything others don’t use. If you can monetize it, you can pay them back focus on the controllable. Ignore what’s outside your control don’t even waste time on it curl supports globbing, emails Beetrove is a ranking of the popularit of OpenAI GPTs Gemini Prompt Guide has detailed examples of how each role can use Gemini ESLint’s new flat configuration does not support package.json

Things I Learned - 28 Apr 2024

This week, I learned: Tough prompt to test: Gr brx vshdn Fdhvdu flskhu? is a quick way to assess LLM capability. Ref Cheap cloud GPU services thread on Twitter lists: Runpod (17) Vast.ai (17) Modal Labs (8) fly.io (4) LightningAI (4) Colab (4) AkashNet (4) Lambda Labs (4) ShadeFormAI (3) Mac Mini (3) Tensor Dock (2) Hetzner (2) BrevDev (2) JSR lets you publish Deno packages that can be imported by npm via. It also auto-evaluates documentation and scores it! via Snowflake Arctic Cookbook explains how mixture of experts models work A long list of LLM courses online Embeddings can be averaged. So, to embed large documents, average the embeddings of their chunks! OpenAI suggests this.

Things I Learned - 21 Apr 2024

This week, I learned: Effort engine introduces “effort” as a parametrizable way to speed up LLMs with a quality trade-off. Works on Mistral for now. Many arts demand devotion. Devoting unrestricted time is part of that. 16 hours of practice a day is not uncommon. Sessions don’t start and end on time. Instruments take a lot longer to learn than vocal music. The instrument needs to become an extension of you. Tests and homework have a purpose. It helps people figure out whether they’ve learnt. So: Write tests that make people think! Like DuckDB workshop Share a list of exercises that people can explore People need to explicitly be INVITED, and potentially IN PERSON, before they will engage with something new. For example, no one posted to [email protected] until the VIA Talks session where we got them to post. For example, having one day at IITM mandatory (especially early in the course) gets online students familiar with TAs. They understand that TAs actually help, at high quality. That they can use Discord. What makes Delhi students more assertive? How can we inculcate that in others? jsr-io/migrations is a great example of database migrations. Shape Detection API in the browser detects QR codes, face bounding boxes, Browsers also natively support blurring and face tracking. via Lessons after half a billion GPT tokens for GPT-4: Vague instructions are better than over-specifying Avoid libraries like Langchain. APIs are stabler 1 token = 3 characters is good enough GPT4 doesn’t hallucinate much, except it does a poor job of saying “I don’t know” or “There’s no such data” (the null hypothesis) Keep the output down to 10 items or so if you’re listing. For longer lists, have it explicitly enumerate Don’t worry about niches. Just wait for GPT5 #WRITE GPT clearly prefers 42 as a random number. #WRITE fal.ai “animates” pictures, creating videos. It made one from my talk. I morphed into various somewhat similar people rapidly in a 2-second span. Very promising, and far from good. llmsherpa extracts PDFs using LLMs. It has errors but it preserves hierarchy, extracts tables well, and retains image coordinates. Via +91 90031 35354 ~Vetrivel PS www.web.sp.am is a content farm that’s getting hit by OpenAI. Highlights how easy it is to create content farms, and therefore “easy” it can be to introduce bias into LLMs. OpenAI supports batching requests. Didn’t know that. Marvin provides Python decorators to create AI functions. Pretty intuitive! Outlines generates structured test with LLMs. It uses the ⭐ logit_bias trick to limit choices in output. See get_choice() Lemur from Assembly.ai does real time call transcription and summary W3C is exploring ways to allow web pages to train LLMs, to flag content as AI generated, etc. Data Provenance Explorer lists open datasets used to train LLMs. Summarize.tech summarizes YouTube videos. #WRITE Stable Audio 2.0 generates 3 min of music from a prompt. I tried Bollywood Tamil film background music. Dark, soulful and Horror movie background. Drums starts darkly. Build up to a crescendo of intense chaos.. Great that it managed, but not great music. Somewhat stereotyped. I need to learn how to prompt better. BTW, Udio is another such. Harpa.ai is a well designed Chrome extension / plugin that can chat with or automate any page. Due to in-context learning, giving 100s of examples in the prompt can teach LLMs to jailbreak. Ref With RAG on search becoming big, search APIs are growing. serper.dev, you.com, searxng being examples.

Things I Learned - 14 Apr 2024

This week, I learned: Prashant Pandey: we need to prepare before every meeting. Something to teach VS Code Select any code and command Explain this to understand the code %something in command bar searches ACROSS files for a term. Exactly like Ctrl+Shift+F Copilot has an Inline Chat: Start in Terminal (that needed me to unbind Ctrl+I in bash to work) Ctrl+2 opens a second window on the side. Ctrl+1 goes back to the first window Terminal: Open Detected Link lets you scroll through detected (file) links in terminal Terminal sticky scroll is transparent. (But Terminal stick scroll isn’t working for me.) Copilot uses last 10 commit messages, Jupyter notebook kernel state (variables) as additional context 1.88: supports locked scrolling to sync scrolling of side-by-side windows fsspec is used by csvbase, Pandas, etc. to implement file system protocols like s3fs, gcfs, etc. SQLime is a SQLite client / playground on the browser! Do nothing. Then do less Humans have a bias against inaction. Hence a strategic advantage. What can you cancel today? Humans have a bias against subtraction or removal. That too is a strategic advantage. What can you remove today? Humans have a bias against constraints. That’s a strategic advantage. What constraint can you embrace? No Yay! When declining something, add it your calendar so that when the time comes you can say yeah I got this time back

Things I Learned - 07 Apr 2024

This week, I learned: CSS nesting is now available in browsers Cold starts in AWS Lambda: serverless functions stay alive for 5-7 min. All languages are fast but Docker is slow. More npm packages slow start dramatically. WiFi only works when it’s raining because a tree was obstructing the signal but was weighed down when raining! Good reasons why finding a technical co-founder won’t work. You want a unicorn to passionately trust YOUR idea after 2 meetings. Why should THEY risk money for YOUR idea? You’re the money guy. RAISE the money for YOUR idea! How passionate are you about software? And you want to build one now? This is a subtle vulnerability. ChatGPT hallucinated pip install huggingface-cli. Sosomeone created the package and got 30,000 downloads! Video-Llava is a video LLM MusicCNN-embeddings provides embeddings for music genre classification How I write podcast. Paul Graham essays Write simply. It helps communicate. (Don’t concise if communication worsens.). It forces you to make the idea better Do lame stuff. Else you won’t start. Low standards drive creativity The more to delete, the better your writing. Read your piece. Highlight what feels poor. Fix it. Ask friends to highlight what’s BORING? UNCONVINCING? Delete the first, brainstorm the second. Or ask, what’s the 10% to cut and 10% to keep. Write about stuff you don’t know above the. Writing GENERATES ideas Write about what’s BUS. GENERAL and SURPRISING. (Laughter is a sign of comprehension.) Do HARD things to cultivate taste. Spend more time with people who generate ideas in you. Ravi chithappa. Ram. Ankor. Ganes. Books! Build taste. I have a taste for picking technologies. Data visualization. Retrospect. Write down what you like and dislike. Copy what you REALLY like. Guilty pleasures. A benefit of lower standards is that it let’s you pick the path less travelled. ITERATE. Discuss ideas. Iterate. Acknowledge. ITERATE.

Things I Learned - 31 Mar 2024

This week, I learned: sqlite-schema-diagram generates schemas for SQLite databases using Graphviz TechEmpower web server benchmarks place Rust servers on top browser.new is a good example of a browser agent. It slowly but independently does a good job of achieving the result. Example: What crew is common in Ingrid Bergman - Cary Grant films? twinny is an open source VC Code Copilot alternative. typesense supports embeddings natively. Binary embeddings are good enough. Cohere releases binary embeddings. Extract.langchain.com is a poor early interface to featurize unstructured.io Hume.ai offers voice emotion API and emotion-based conversational responses. An empathic AI. Rust is non-trivial. Inspired by We are under DDoS attack and we do nothing, I “wrote” a small binary that serves a parquet file as JSON. It failed and I couldn’t fix it. spleeter is a better alternative to demucs. Splits audio into pyannote-audio does speaker diarization uvicorn is faster than hypercorn but hypercorn supports HTTP/2 and HTTP/3. FastAPI with uvicorn is reasonably fast. Representational engineering lets you control LLM output based on preference on the fly. When I set up a training: On inviting for DuckDB workshop on Sun evening, Gramener starts accepting immediately, Straive doesn’t. Straive has high spread of joining time. When joining Gitlab Pipelines Workshop, Straive starts meeting (e.g. Premlal) many minutes early. Gramener floods in (due to alert). Straive streams in slowly. Gitlab Pipelines Workshop acceptances: Gramener 47, Straive 100

Things I Learned - 24 Mar 2024

This week, I learned: Ways to expand mental models DISCOVER mental models. Review beliefs diary. DIVERSIFY. Find INFLUENTIAL (not dull) people with different backgrounds. Experiment! New environment, approach, perspective Be open. Change your mind. APPLY. Practice regularly Ways to use inversion “Pre-mortem” is an analysis at the beginning of how a project failed. Then avoid that “Red team” or “Black hat” are designated to contradict. Having a PoV IS a hypothesis. Always having a PoV allows us to detect anomalies and learn. Control vectors in real-time lets you control response in real-time OIDC is Open ID Connect. It’s like OAuth2 but more. Azure and Google support it. Planka is an open-source Trello There is a https://myapplications.microsoft.com/ that serves as a starting point. Might be helpful Instructor lets you create structured JSON output.

Things I Learned - 17 Mar 2024

This week, I learned: DuckDB is 2-10 times faster than Pandas. ClickHouse is supposedly faster but doesn’t run on Windows. Claude 3 Haiku input costs is $0.25/MTok. That’s half the GPT-3.5 cost. If it’s of comparable quality, it’s worth switching. But Claude 3 Opus is comparable to GPT-4 and twice the cost, so not worth it. Tavily is a search API for LLMs Interesting model garden models There are sites you TRULY cannot scrape even in the browser because of the isTrusted read-only property of events that you can never set to true. Oracle Service Cloud checks for isTrusted in mouse actions.

Things I Learned - 10 Mar 2024

This week, I learned: Mughals just replaced the top of most temples with Mosque domes as part of the conquer or die policy “Math is racist”. Because people who can’t solve it can’t because of their underprivileged background! Winners: commodity businesses, companies that own lots of data like Reddit and Stackoverflow, profitable bootstrapped businesses Making a tool more usable, e.g. a video, can have a 10-100X impact. Yet every developer thinks it’s redundant All in one podcast. Can Google save itself? the success of a developer platform is the number of people using it. But not everyone uses it equally. Some people create winning products which drives attention to the platform. Use llm proxy like that to measure weekly average users and cost saving through caching one week ago, if someone at Google stood up and said we have too many black people in our images, the responsible AI team would have shut them down calling them racist. They had too much power and it was a one-way conversation. With the backlash now, there is a lot more awareness and acceptance of the balance. Security is like that. It’s too easy to empower and shut things down until there is a backlash the lawyer’s job is to tell you what’s not possible. But like Travis, your job is to decide whether it’s worth the risk of running a taxi company without a license or not Americans pronounce Sundar pichai’s name as Sun Daar! data licensing has become a business model. Reddit, Accel springer, stack overflow and many others are licensing their content to Google and open AI for several million dollars.

Things I Learned - 03 Mar 2024

This week, I learned: You can use slots to stream HTML out of order! Shane Parrish. Short-term patience podcast have a frame of reference to relate EVERY experience to. That helps you evaluate (measure) and learn. That’s part of what Charlie Munger’s lattice of frameworks is about when there is a very high or very low interest scenario, low interest scenario then go ultra long term. Issued hundred years when the interest rate regime was very low short term optimal is rally long term optimal. So you need to learn to take a loss and look like an idiot to play the long-term game grit is a behavior that enables long-term thinking. Short term success gives you the luxury to think about long term #IMP power is about optionality. It’s about being in a position where you have the options that can affect the positive change rather than circumstances controlling you. Read Robert greene’s book on the 48 laws of Power low leverage enables that begin with the end in mind. Always how do you think about risk? Well, things do happen. It’s as simple as that autonomy and decentralization helps derisk do more and more of what works. That’s a powerful way of compounding long-term investments are better than frequent trading because you get to reinvest the tax you otherwise would have paid. So unless the alternative is super compelling, stay invested if you need to be the person who DOES the thing, you delegate less, leverage list, compound less, because you have to DO. BE A PERSON WHO SETS THE FIELD INSTEAD. The coach, the chess master, the director, patient strategist who Waits for the good hit Being in Control motivates #Lesson. my cycle tires were flat. I thought it was someone pulling out the air and felt very demotivated. But once I carried my cycle pump, I felt so much more in control and power and felt a whole lot better SourceGraph is the default platform for private code completion & search MetaVoice 1B offers voice cloning on American & British accents with 30s training Qwen 1.5 72B appears to outperform Mistral Medium, making it one of the top non-proprietary models Llava 1.6 is a substantial improvement over Llava 1.5 and slightly better than CogVLM, Qwen-VL AI scams are growing. Deepfakes scammed $34m. But voice fake for kidnapping is scarier. Buildspace’s demo is a great demo of how voice and actions can be used effectively. demucs does an EXCELLENT job of splitting songs into drums, bass, vocals and others

Things I Learned - 25 Feb 2024

This week, I learned: Architecture.md is an emerging standard Managing wealth requires training. htmz is a fantastic way to load HTML into elements! Suguna Poultry is Using robots to walk in their farms, use sound and bird eyes and movement to predict birth health over 1-2 weeks Light on the back of the bird’s back AND face => lays eggs in 14 days, else takes days later (girls and mobile phones?) Teknoturf is using Gen AI to Improve prompts when teaching prompt engineering. Pronounce languages better, identifying which words Tamilians and Malayalis will mis-pronounce. Explore IRBlaster. It can control AC and can automatically increase temperature at night. My view: LLMs are general purpose and more capable than SLMs. They’ll win, like CPUs won over special-purpose chips. GPUs will optimize for LLMs and as usage grows, cost will fall. Andrej Karpathy’s summary of sharp edges in tokenization uses tiktokenizer to explain: Why LLMs can’t be used for spelling Why LLMs are better at English than other languages Why LLMs are bad at math Why SolidGoldMagiKarp is a single token Why trailing spaces are bad Why YAML tokenizes more efficiently than JSON ssyoutube.com: Just add “ss” to “youtube.com” on the video and you can download YouTube videos Discussions with Sachin, AMAT Microsoft said Indigo, Air India uses LLM based bookings Meta invested $70bn in GPUs. Sam Altman is investing $7tn! NVIDIA has a price PREMIUM not discount for bulk GPUs! AMD is the next company to watch for Numenta - Subutai Ahmad - deploys AI models on CPUs #TODO Read A Thousand Brains by Subutai Ahmad Sanjeev Sharma Swaayatt Robots: Autonomous driving in India Deepeigen: Education Rohan Shravan, Bangalore. Likes sharing knowledge. Amazing teacher. IIT KGP 2008. Interested in exploring quantum computing Tresa Motors, Inkers App, The School of AI AMAT is working on photon-based computing. science research models. AI for science. Like Google: Deepmind Genome, Microsoft: Metagen quantum: AMAT is actively in into this. Nagapati Banda is driving this John Kelly is predicting a ChatGPT moment in quantum in a few years Adobe express has a forever free video to GIF converter Edge workspaces let me keep the same tabs open across laptops! Command line interface guidelines RAWGraphs has a custom charts API that is worth learning from Python fastcore has decorators like @typedispatch, Self, etc. All image-to-text models on HuggingFace wddbfs mounts SQLite as a file system. I had a bit of trouble, maybe with Python package versions. Google is using LLM powered bug identification HuggingFace Chat Assistants has open source system prompts!! OpenHermes training dataset is available. 1M prompts! Jio has made IPL free. They make money on data and ads. That’s Scale! Daniel Dennett outsources thinking to students. Reviewing his books. BUT: I don’t take feedback. When someone sends a pull requests, I ignore it.

Things I Learned - 18 Feb 2024

This week, I learned: Fine tuning makes economic sense only if the input tokens SAVED is twice the output token size on each call. Docker container memory usage on WSL2 docker stats frolvlad/alpine-glibc:alpine-3.17: 540KB ubuntu: 1MB (python3: +5MB) nikolaik/python-nodejs:python3.10-nodejs18-bullseye: 1.4MB (python3: +5MB) python:3-alpine: 612KB (python3: +7.5MB) python:3: 500KB (python3: +11.2MB) continuumio/miniconda3: 7.6MB (+6.5MB) Discussion with Vinu Yamunan Databuck by FirstEigen. Autolysis plus monitoring Quality council has the data steward (maintainer of each dataset) coming together with the uses on a weekly basis to understand what quality problems to users are facing. Data owners jaundice at a lower frequency to get an understanding #TODO Automate rules for data quality in our projects and intranet Convert a config rule into business language. Explain SQL. These are good use cases for llm’s Graph DBs are powerful for flexible data structures, but query generation needs AI or expertise. Check the Neo4J language cypher Explore storing SAME data in relational DBs AND in graph DBs / document DBs for different use cases Dallas rocketry challenge. Build a rocket that can take an egg to 800 feet exactly and land without breaking it Discussion with Karthik A #TODO Ask IIT students to do internship tasks. Use advent of code is a qualifying criterion Tata motors unionized DB admins for longevity. No one can take their jobs. Hires people who LIKE their jobs Rust gives me typing. It’s very efficient. Pola.rs is interesting but Pandas as good enough. Explore alerts from CCTV feeds. Karthik sends email alerts with pictures for: “Is the machine on or off”? for productivity “Are people not wearing helmets?” for safety at Cummins #TODO Integrate with WhatsApp. Use LLMs with function calling for responses Use expiring links (to pictures or content). It increases engagement Check Deno licensing. Is there a commercial clause? #ANS No - it’s MIT license Centre or excellence for zero emission tech at IIT. Karthik is part of it Explore auth0. 7000 users are free toml is part of the Python 3.11 standard library! If copilot writes code we don’t understand we are screwed. Hence expertise matters Discussion with Vikas Kedia #TODO Plan an AMA The mind becomes lazy with financial success. Vikas is treating his podcast as a startup Hire a professional videographer for your content Financial RoI in financial markets is the highest. Programming is high too but FS is even better “Performative power” – when you’re forced to perform, you get better ideas Observable 2.0 is an open source static site generator for data Python dataclasses SORA is OpenAI’s video generation model, and is stunning! If Appa comes to Singapore even for a week, he will feel better and can boast to his friends. At over 90, it may be better to move Appa to where I am since many of his friends would be no more and shops, doctors, etc can be managed and getting an independent house nearby is not hard. There is an SEZ in Gujarat where Indians can invest like in Mauritius without forex restraint Shubha: Media sites are moving away from Vickrey auctions to first-price auctions for ads. That’s because they send the auction price forward to a search engine and the winning second-price value can lose even though the owner is willing to pay more. Second-price auctions don’t work unless ALL bidders are in the SAME auction. Ad networks are a hierarchy of auctions! Gemini 1.5 launched. Fly.io offers GPU hosting and auto stop when they have nothing to do. Embeddings in random forest are very effective at classification – much better than dot product. To deploy apps with OAuth + templating support in a small Docker container, use Caddy Deno has native TypeScript, browser APIs, and compiles to multiple OSs Ruff is a MUCH faster flake8 Two pass generation is a clever technique to get multiple SEQUENTIAL answers in a single API request. For example the schema {'code', 'optimized_code'} will generate code and then optimize it. Unions in function calling allows flexible multi-step prompts in a single API.

Things I Learned - 11 Feb 2024

This week, I learned: Dockerfile can have FROM scratch and you can add specific binaries rather than an entire OS. via Fine-tuning session by Dan. Notebook Example of fine-tuning Mistral. Consumed 28 computes ($2.8) Axlotl is what the top fine-tuned LLMs are trained on Deepspeed provides distributed training Flash attention lets data stay on GPU Sample packing packs samples of different lengths into equal length tensors Visualize the RANK of a token in a generated stream instead of logprob The Knowledge Project. Tomorrow Gayner What I’d like in my obituary: Anand was happiness. A guru. Generous. To get what we seek we must deserve this. Build, measure, learn If you did the same thing daily for 50 years, would it be a great thing? If yes, do it. If not, stop. Do this in daily retrospectives My new role should be productivity through technology innovation. That may mean a CTO role. But be specific otherwise no one will understand it Hidden brain podcast. Us 2.0. Win hearts, then minds When in an interaction, ask yourself. Can I learn and change myself? Can I win their hearts, then mines, so their behavior will change. That identity will change Notice when you get emotionally triggered. That’s exactly when you should not get emotionally triggered Try model humility and moral Look for close to people’s identities in our conversations. What are things they like? What does it mean for them? Simply ask. With that understanding of identity, it becomes easier to reframe things in a way they will understand Bard can talk to Gmail and Google Drive! #PREDICTION As automation takes over these mainstream activities, people will take over the niches. Since expertise like knowledge is fractal, there will be many more segments of one in the future and it will be easier to automate clusters of similar abilities. Recommenders and brands will become even more important Stephen Osserman’s Observables have some nice notes. Visualizing partial election results D3 Force Dilemmas: Data Distortion Sandra Becker’s 30 day D3 course