journal2024-08-12On this page2024-08-12 Is ChatGPT an N-gram model on steroids? - YouTube How to Prune and Distill Llama-3.1 8B to an NVIDIA Llama-3.1-Minitron 4B Model | NVIDIA Technical Blog [2407.14679] Compact Language Models via Pruning and Knowledge Distillation [2408.06643] BMX: Entropy-weighted Similarity and Semantic-enhanced Lexical Search Small but Mighty: Introducing answerai-colbert-small – Answer.AI2024-08-18 - Sunday Ask HN: What do you monitor on your servers? | Hacker News #monitoring GitHub - duckdb/pg_duckdb: DuckDB-powered Postgres for high performance apps & analytics. #duckdb mlx-examples/whisper at main · ml-explore/mlx-examples · GitHub