LLMs (Large Language Models)
#ml/nlp/llm
GitHub - lm-sys/FastChat: The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
GitHub - kernelmachine/cbtm: Code repository for the c-BTM paper
Tokenization
- GitHub - SumanthRH/tokenization: A comprehensive deep dive into the world of tokens
- GitHub - openai/tiktoken: tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Chat
- GitHub - imoneoi/openchat: OpenChat: Advancing Open-source Language Models with Imperfect Data
- LangChain
Alignment
Instruction Tuning
- GitHub - huggingface/alignment-handbook: Robust recipes for to align language models with human and AI preferences
- GitHub - allenai/open-instruct
RLHF
DPO
Training
Fine Tuning
Inference
Serving
Quantization
- GitHub - PanQiWei/AutoGPTQ: An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
- GitHub - IST-DASLab/gptq: Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Tutorials
Visualization
Decoding
Inference Benchmarks
Evaluation
Evaluate LLMs and RAG a practical example using Langchain and Hugging Face