Welcome to my "Today I Learned" (TIL) repository! This is a personal space to document and share concise notes on various topics I encounter and learn daily. The goal is to create easily digestible snippets of information that can be quickly reviewed and understood. Think of it as a public learning journal to track my learning journey in my free time.
This will be dynamically updated as I add more TILs.
This repository is structured by topic categories. Each "TIL" entry should be a short, focused explanation of a single concept, fact, or skill I've learned. I aim for clarity and conciseness, making each entry a quick and valuable read.
- Using Gemini 2.0 Flash for High-Quality Audio Transcription and Analysis: Using Gemini 2.0 Flash Thinking Experimental 01-21 model for high-quality audio transcription and analysis on Bilibili videos.
- Latent Space Revolution: A Deep Dive into VAE Architectures and Performance Comparison of Flux.1 and Stable Diffusion: Deep analysis and performance comparison of VAE in Flux.1 and Stable Diffusion, and other related autoencoders.
- Rerun and Foxglove: Emerging Data Visualization Platforms for Robotics: Introducing Rerun and Foxglove, two emerging data visualization platforms for robotics, and comparing them with RViz and Unity.
- Fusion and Evolution: How VLMs and World Models are Reshaping Autonomous Driving Technology: Deep analysis of VLM-E2E, Doe-1, and DriveVLM in the context of autonomous driving, and the comparison of VLM, VLA, and world model architectures.
While this is primarily a personal learning log, constructive feedback and suggestions are welcome! If you spot any errors or have ideas for improvement, feel free to open an issue.
This project is licensed under the MIT License. See the LICENSE file for details.