This week, I learned:
- Speech editing in audio files is a thing. Speech Editing Toolkit and Descript
- GPT 4o Mini is almost as good as GPT 4o in the LMSYS leaderboard. Llama 3.1 400B model and Mistral 2 Large are yet to be evaluated.
- If LLMs can generate any text, and text can describe the real world, we can rapidly generate “artifacts” that generate:
- 3D Printable Models:
- STL (Stereolithography): Defines the surface geometry of 3D objects using triangular facets.
- OBJ (Wavefront OBJ): Describes 3D geometry including vertices, textures, and normals.
- X3D: An XML-based file format for representing 3D computer graphics.
- Vector Graphics:
- SVG (Scalable Vector Graphics): Defines vector-based graphics in XML format, useful for illustrations, diagrams, and user interface elements.
- CAD Drawings:
- DXF (Drawing Exchange Format): Represents CAD data, including shapes, lines, and curves, used in engineering and architecture.
- Circuit Designs:
- KiCAD: An open-source software suite for Electronic Design Automation (EDA), which uses various file formats like PCBNew and EESchema to represent circuit designs.
- Blueprints and Architectural Designs:
- GML (Geography Markup Language): Encodes geographical features and spatial information.
- CityGML: A specific GML application schema for modeling and exchanging 3D city models.
- Molecular Structures:
- PDB (Protein Data Bank): Describes the three-dimensional structures of molecules.
- CML (Chemical Markup Language): An XML-based standard for representing molecular data.
- Robotics and Automation:
- URDF (Unified Robot Description Format): Defines the physical configuration of a robot, including joints, links, and sensors.
- COLLADA (Collaborative Design Activity): An XML-based schema to describe digital assets for 3D applications, often used in robotics.
- Geospatial Data:
- KML (Keyhole Markup Language): Used for geographic data visualization, primarily in Google Earth.
- GeoJSON: A format for encoding a variety of geographic data structures using JSON.
- Mathematical Markup:
- MathML (Mathematical Markup Language): Describes mathematical notation and captures both its structure and content.
- Music and Sound:
- MusicXML: Encodes sheet music in a structured format that can be easily shared between different music notation software.
- Documents and Text:
- DocBook: A semantic markup language for technical documentation.
- Markdown: A lightweight markup language with plain text formatting syntax.
- Biological Data:
- SBML (Systems Biology Markup Language): Represents computational models of biological processes.
- PhyloXML: An XML format for representing phylogenetic trees.
- Game Development:
- FBX (Filmbox): A file format for 3D animation that can hold information about the geometry, textures, and animations.
- VRML (Virtual Reality Modeling Language): Describes interactive 3D objects and worlds.
- Data Visualization:
- ChartML: Encodes charts and graphs in a structured format.
- D3.js (Data-Driven Documents): Uses HTML, SVG, and CSS to bring data to life with interactive visualizations.
- Building Information Modeling (BIM):
- IFC (Industry Foundation Classes): Describes building and construction data.
- Textiles and Fabrics:
- LoomML: Represents the design and structure of woven fabrics.
- Augmented Reality and Virtual Reality:
- ARML (Augmented Reality Markup Language): Defines how augmented reality applications should behave and what content they should display.
- VRML (Virtual Reality Modeling Language): For describing interactive 3D objects and worlds.
- Medical Imaging and Health Data:
- DICOM (Digital Imaging and Communications in Medicine): Encodes medical imaging data.
- HL7 (Health Level 7): A set of standards for the exchange of information between medical applications.
- Simulation Data:
- FMI (Functional Mock-up Interface): Represents and exchanges dynamic simulation models.
- SBML (Systems Biology Markup Language): For computational models of biological processes.
- Sound and Audio:
- MML (Music Markup Language): For encoding music notation and performance information.
- SoundFont: A file format for defining musical instrument sounds.
- Animation and Visual Effects:
- BVH (Biovision Hierarchy): Encodes motion capture data.
- Alembic: A computer graphics interchange framework primarily for exchanging animation and visual effects data.
- Textile Patterns:
- WIF (Weaving Information File): Describes weaving patterns and structures.
- Knitting Markup Language: Encodes knitting patterns in a structured format.
- Scientific Data:
- CDF (Common Data Format): Used for storing scientific data.
- NetCDF (Network Common Data Form): Supports the creation, access, and sharing of array-oriented scientific data.
- Photography and Imaging:
- XMP (Extensible Metadata Platform): Used for embedding metadata in digital images and other media files.
- Construction and Engineering:
- LandXML: For civil engineering and land surveying data.
- gbXML (Green Building XML): Facilitates the transfer of building data for analysis of energy and environmental performance.
- Packaging and Retail:
- BPL (Barcode Product Labeling): Encodes information for product packaging and labeling.
- GS1 XML: Used for electronic business messaging, including product identification and tracking.
- Typography and Font Design:
- UFO (Unified Font Object): A format for storing font data.
- SFNT (Spline Font): Encodes scalable font information.
- Product Data Management:
- PLMXML (Product Lifecycle Management XML): Used for sharing product data across PLM systems.
- 3D Printable Models:
- GPT 4o Mini can be fine-tuned!
- Awesome PaaS lists self-hosted deployment platforms. Piku - similar to Dokku – is promising.