This week, I learned:

  • My home WiFi is on WiFi 6. This supports beam-forming which increases range by “focusing” on devices!
  • Predibase lets you run fine-tuned models at the same price, on a per-token basis. 25c/MTok up to 21B models. That’s sames as Claude 3 Haiku, but with fine-tuning.
  • RunPod’s vLLM endpoint lets you run any HuggingFace LLM with an OpenAI API priced on usage (serverless) not on idle time. “Autoscaling to 0”.
  • Portkey is an LLM router