- [2023 arxiv] Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond [paper]
- [2023 arxiv] A Survey of Large Language Models [paper]
- [2023 arxiv] One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era [paper]
- [2023 arxiv](Meta) LLaMA: Open and Efficient Foundation Language Models [paper][code]
- [2023 arxiv] (OpenAI) GPT-4 Technical Report [paper]
- [2023 Github] (Stability-AI) StableLM: Stability AI Language Models [code]
- [2023 Github] LlamaHub [code] (NOTE: a simple library of all the data loaders)
- [2023 Github] LlamaIndex [code] (NOTE: a central interface to connect your LLM's with external data)
- [2023 arxiv] LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention [paper][code]
- [2023 Github] Stanford Alpaca: An Instruction-following LLaMA model [code]
- [2023 Github] Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90% ChatGPT Quality [code]
- [2023 Github] Llama-X: Open Academic Research on Improving LLaMA to SOTA LLM [code]
- [2023 Github] DeepSpeed Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales [code]
- [2023 Github] LangChain: Building applications with LLMs through composability [code]
- [2023 Github] Auto-GPT: An Autonomous GPT-4 Experiment [code]
- [2023 arxiv] HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace [paper][code]
- [2023 arxiv] ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge [paper][code]
- [2023 arxiv] BloombergGPT: A Large Language Model for Finance [paper]
- [2023 arxiv] The Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics [paper][code]
- [2023 arxiv] ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks [paper]
- [2023 arxiv] Language Models can Solve Computer Tasks [paper]
- [2022 arxiv] BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining [paper][code]
- [2023 Github] Dolly [code]