GitHub - KUNALSINGH9373/Large-Language-Models-Papers: This repo contains all influential research papers and blogs related to LLMs.

Large Language Models (LLMs) are a class of cutting-edge AI models that utilize vast amounts of text data and powerful deep learning architectures to achieve remarkable feats in natural language processing (NLP). These models push the boundaries of what computers can do with language, generating human-quality text, translating languages, writing different kinds of creative content, and even answering your questions in an informative way.

Understanding LLMs: Core Principle: LLMs learn by analyzing massive amounts of text data, identifying patterns and relationships between words and sentences. This training allows them to predict the next word in a sequence or generate new text that is statistically similar to their training data. Key Component: Transformers - a specific deep learning architecture that excels at analyzing relationships between words in a sentence, regardless of their position. This enables LLMs to understand contextual meaning and generate coherent text. Capabilities: LLMs can accomplish various tasks depending on their training data and specific architecture. Some examples include: Generative tasks: writing different kinds of creative content like poems, code, scripts, musical pieces, emails, letters, etc. Informative tasks: answering your questions in an informative way, summarizing documents, translating languages, writing different kinds of creative content.
Training LLMs: Dataset Size: LLMs are trained on colossal datasets of text and code, often containing billions of words or even more. This vast amount of data provides the rich vocabulary and diverse examples the model needs to learn effectively. Pre-training and Fine-tuning: Training typically involves two stages: pre-training on a general-purpose dataset like Wikipedia and then fine-tuning on a specific task-related dataset. This two-step process enables the model to learn general language skills and then specialize in a particular domain. Challenges: Training LLMs requires significant computational resources and expertise. Additionally, biases present in the training data can be reflected in the model's outputs, necessitating careful data curation and bias mitigation techniques.
Impact and Applications: Revolutionizing NLP: LLMs are transforming the field of NLP, creating possibilities for more natural and interactive human-computer interactions in various contexts. Creative Applications: LLMs can be used for creative writing, code generation, and other artistic endeavors, pushing the boundaries of human-machine collaboration. Real-world Applications: LLMs have potential applications in areas like customer service, education, journalism, and more, automating tasks and enhancing information access.
Ethical Considerations: Bias and Fairness: LLMs trained on biased data can perpetuate harmful stereotypes and discriminatory practices. Addressing bias through careful data selection and model development is crucial. Misinformation and Explainability: The ability of LLMs to generate realistic text raises concerns about misinformation and the need for transparency in model outputs and decision-making processes. Accessibility and Openness: Access to LLMs and the data they use should be democratized to avoid exacerbating existing inequalities and encourage responsible development and application

In conclusion, LLMs represent a significant step forward in the field of AI, opening up exciting possibilities for how we interact with technology and use language. However, it's important to acknowledge the challenges and ethical considerations associated with these powerful models and ensure their development and deployment are mindful of their potential impact on our world.

Blogs and other information about LLMs

The Inner Workings of LLMs: A Deep Dive into Language Model Architecture: https://www.analyticsvidhya.com/blog/2023/07/inner-workings-of-llms/
A Comprehensive Guide to Fine-Tuning Large Language Models: https://www.analyticsvidhya.com/blog/2023/08/fine-tuning-large-language-models/#h-the-need-for-fine-tuning-llms
Transfer Learning from Large Language Models (LLMs): https://maddevs.io/blog/transfer-learning-from-large-language-models/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
A Comprehensive Overview of Large Language Models.pdf		A Comprehensive Overview of Large Language Models.pdf
A Comprehensive Survey on Transfer Learning.pdf		A Comprehensive Survey on Transfer Learning.pdf
A Decade Survey of Transfer Learning (2010–2020).pdf		A Decade Survey of Transfer Learning (2010–2020).pdf
A General Language Assistant as aLaboratory for ALignment.pdf		A General Language Assistant as aLaboratory for ALignment.pdf
A Survey of Large Language Models.pdf		A Survey of Large Language Models.pdf
A Survey on Transfer Learning.pdf		A Survey on Transfer Learning.pdf
An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language model.pdf		An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language model.pdf
Context Tuning for Retrieval Augmented Generation.pdf		Context Tuning for Retrieval Augmented Generation.pdf
FAIRTUNE- OPTIMIZING PARAMETER EFFICIENT FINE FOR FAIRNESS IN MEDICAL IMAGING.pdf		FAIRTUNE- OPTIMIZING PARAMETER EFFICIENT FINE FOR FAIRNESS IN MEDICAL IMAGING.pdf
Failure Modes of Learning Reward Models.pdf		Failure Modes of Learning Reward Models.pdf
Fine-Tuning Language Models from Human Preferences.pdf		Fine-Tuning Language Models from Human Preferences.pdf
Fine-Tuning Pretrained Language Models- Weight Initializatioins, Data Orders and Early Stopping.pdf		Fine-Tuning Pretrained Language Models- Weight Initializatioins, Data Orders and Early Stopping.pdf
Fine-tuning Language Models for Factuality.pdf		Fine-tuning Language Models for Factuality.pdf
Fine-tuning language models to find agreement among humans with diverse preferences.pdf		Fine-tuning language models to find agreement among humans with diverse preferences.pdf
IMPROVING LARGE LANGUAGE MODEL FINE-TUNING FOR SOLVING MATH PROBLEMS.pdf		IMPROVING LARGE LANGUAGE MODEL FINE-TUNING FOR SOLVING MATH PROBLEMS.pdf
INSTRUCTION TUNING LARGE LANGUAGE MODEL ON REGION OF INTEREST.pdf		INSTRUCTION TUNING LARGE LANGUAGE MODEL ON REGION OF INTEREST.pdf
INTRINSIC DIMENSIONALITY EXPLAINS THE EFFECTIVENESS OF LANGUAGE MODEL FINE-TUNING.pdf		INTRINSIC DIMENSIONALITY EXPLAINS THE EFFECTIVENESS OF LANGUAGE MODEL FINE-TUNING.pdf
Instruction Tuning for Large Language Models A Survey.pdf		Instruction Tuning for Large Language Models A Survey.pdf
LORA-- LOW-RANK ADAPTATION OF LARGE LANGUAGE MODELS.pdf		LORA-- LOW-RANK ADAPTATION OF LARGE LANGUAGE MODELS.pdf
Learning Reward for Physical Skills using Large Language Model.pdf		Learning Reward for Physical Skills using Large Language Model.pdf
Learning to summarize from human feedback.pdf		Learning to summarize from human feedback.pdf
Learning_the_Reward_Model_of_Dialogue_POMDPs_from_data.pdf		Learning_the_Reward_Model_of_Dialogue_POMDPs_from_data.pdf
On the Effectiveness of Parameter-Efficient Fine-Tuning.pdf		On the Effectiveness of Parameter-Efficient Fine-Tuning.pdf
PAIRWISE PROXIMAL POLICY OPTIMIZATION-- Harnessing relative feedback for LLM alignment.pdf		PAIRWISE PROXIMAL POLICY OPTIMIZATION-- Harnessing relative feedback for LLM alignment.pdf
Parameter-Efficient Fine-Tuning Methods for Pretrained Language Model.pdf		Parameter-Efficient Fine-Tuning Methods for Pretrained Language Model.pdf
Parameter-Efficient Transfer Learning for NLP.pdf		Parameter-Efficient Transfer Learning for NLP.pdf
Prefix-Tuning- Optimizing Continuous Prompts for Generation.pdf		Prefix-Tuning- Optimizing Continuous Prompts for Generation.pdf
Proximal Policy Optimization Algorithms.pdf		Proximal Policy Optimization Algorithms.pdf
README.md		README.md
REWARD DESIGN WITH LANGUAGE MODELS.pdf		REWARD DESIGN WITH LANGUAGE MODELS.pdf
Retrieval-Augmented Generation for Knowledge Intensive NLP Tasks.pdf		Retrieval-Augmented Generation for Knowledge Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Large Language Models-- A Survey.pdf		Retrieval-Augmented Generation for Large Language Models-- A Survey.pdf
Revisiting Parameter-Efficient Tuning-Are We Really There Yet.pdf		Revisiting Parameter-Efficient Tuning-Are We Really There Yet.pdf
STANDING ON THE SHOULDERS OF GIANT FROZEN LANGUAGE MODELS.pdf		STANDING ON THE SHOULDERS OF GIANT FROZEN LANGUAGE MODELS.pdf
Scalable agent alignment via reward modeling-- A research direction.pdf		Scalable agent alignment via reward modeling-- A research direction.pdf
Scaling Laws for Reward Model Overoptimization.pdf		Scaling Laws for Reward Model Overoptimization.pdf
Scaling laws for LLMs.pdf		Scaling laws for LLMs.pdf
Secrets of RLHF in Large Language Models.pdf		Secrets of RLHF in Large Language Models.pdf
Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics.pdf		Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics.pdf
Small Pre-trained Language Models Can be Fine-tuned as Large Models via Overparameterization.pdf		Small Pre-trained Language Models Can be Fine-tuned as Large Models via Overparameterization.pdf
Talking About Large Language Models.pdf		Talking About Large Language Models.pdf
The Power of Scale for Parameter-Efficient Prompt Tuning.pdf		The Power of Scale for Parameter-Efficient Prompt Tuning.pdf
Towards Better Parameter-Efficient Fine-Tuning for Large Language Models.pdf		Towards Better Parameter-Efficient Fine-Tuning for Large Language Models.pdf
Training a Helpful and Harmless Assistant with RLHF.pdf		Training a Helpful and Harmless Assistant with RLHF.pdf
Transfer Learning Toolkit--Primers and Benchmarks.pdf		Transfer Learning Toolkit--Primers and Benchmarks.pdf
Truly Proximal Policy Optimization.pdf		Truly Proximal Policy Optimization.pdf
Trust Region Policy Optimization.pdf		Trust Region Policy Optimization.pdf
Tuning Large language model for End-to-end Speech Translation.pdf		Tuning Large language model for End-to-end Speech Translation.pdf
WebGPT-- Browser-assisted question-answering with human feedback.pdf		WebGPT-- Browser-assisted question-answering with human feedback.pdf
Your Language Model is Secretly a Reward Model.pdf		Your Language Model is Secretly a Reward Model.pdf

KUNALSINGH9373/Large-Language-Models-Papers

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages