Awesome LLMOps

What is LLMOps?

LLMOps is a part of MLOps practices, specialized form of MLOps that focuses on managing the entire lifecycle of large language models(LLM).

Starting in 2021, as LLMs evolved rapidly and the technology matured, we began to focus on practices for managing LLMs efficiently, and LLMOps, which are adaptations of traditional MLOps practices to LLMs, began to be talked about.

LLMOps vs MLOps

	LLMOps	MLOps
Definition	Tools and infrastructure specifically for the development and deployment of large language models	Tools and infrastructure for general machine learning workflows
Focus	Unique requirements and challenges of large language models	General machine learning workflows
Key technologies	Language model, Transformers library, human-in-the-loop annotation platforms	Kubeflow, MLflow, TensorFlow Extended
Key skills	NLP expertise, knowledge of large language models, data management for text data	Data engineering, DevOps, Software engineering, Machine learning expertise
Key challenges	Managing and labeling large amounts of text data, fine-tuning foundation models for specific tasks, ensuring fairness and ethics in language models	Managing complex data pipelines, ensuring model interpretability and explainability, addressing model bias and fairness
Industry adoption	Emerging, with a growing number of startups and companies focusing on LLMOps	Established, with a large ecosystem of tools and frameworks available
Future outlook	LLMOps is expected to become an increasingly important area of study as large language models become more prevalent and powerful	MLOps will continue to be a critical component of the machine learning industry, with a focus on improving efficiency, scalability, and model reliability

⬆️ Go to top

Prompt Engineering

PromptBase - Marketplace of the prompt engineering
PromptHero - The website for prompt engineering
Prompt Search - The search engine for the prompt engineering
Prompt Perfect - Auto Prompt Optimizer
Learn Prompting - The tutorial website for the prompt engineering
Blog: Exploring Prompt Injection Attacks
Blog: Prompt Leaking
Paper: Prefix-Tuning: Optimizing Continuous Prompts for Generation

⬆️ Go to top

Models

Name	Parameter Size	Announcement Date	Provider
GPT-4.5	Undisclosed	Feburary 2025	OpenAI
Grok‑3	Undisclosed	Feburary 2025	xAI
Gemini-2	Undisclosed	Feburary 2025	Google
DeepSeek-VL2	4.5B	Feburary 2025	DeepSeek
DeepSeek-R1	671B	January 2025	DeepSeek
DeepSeek-V3	671B	December 2024	DeepSeek
GPT‑o1	Undisclosed	September 2024	OpenAI
Qwen-2.5	0.5B, 1.5B, 3B, 7B, 14B, 72B	September 2024	Alibaba Cloud
Gemma-2	2B, 9B, 27B	June 2024	Google
Qwen-2	0.5B, 1.5B, 7B, 57B, 72B	June 2024	Alibaba Cloud
GPT‑4o	Undisclosed	May 2024	OpenAI
Yi‑1.5	6B, 9B, 34B	May 2024	01.AI
DeepSeek-V2	238B (21B active)	April 2024	DeepSeek
Llama-3	8B, 70B	April 2024	Meta
Gemma-1.1	2B, 7B	April 2024	Google
DeepSeek-VL	7B	March 2024	DeepSeek
Claude-3	Undisclosed	March 2024	Anthropic
Grok‑1	314B	March 2024	xAI
DBRX	132B (36B active)	March 2024	Databricks
Gemma	2B, 7B	February 2024	Google
Qwen-1.5	0.5B, 1.8B, 4B, 7B, 14B, 72B	February 2024	Alibaba Cloud
Qwen‑VL	Undisclosed	January 2024	Alibaba Cloud
Phi‑2	2.7B	December 2023	Microsoft
Gemini	Undisclosed	December 2023	Google
Mixtral	46.7B	December 2023	Mistral AI
Grok‑0	33B	November 2023	xAI
Yi	6B, 34B	November 2023	01.AI
Zephyr‑7b‑beta	7B	October 2023	HuggingFace H4
Solar	10.7B	September 2023	Upstage
Mistral	7.3B	September 2023	Mistral AI
Qwen	1.8B, 7B, 14B, 72B	August 2023	Alibaba Cloud
Llama-2	7B, 13B, 70B	July 2023	Meta
XGen	7B	July 2023	Salesforce
Falcon	7B, 40B, 180B	June/Sept 2023	Technology Innovation Institute (UAE)
MPT	7B, 30B	May/June 2023	MosaicML
LIMA	65B	May 2023	Meta AI
PaLM-2	340B	May 2023	Google
Vicuna	7B, 13B, 33B	March 2023	LMSYS ORG
Koala	13B	April 2023	UC Berkeley
OpenAssistant	30B	April 2023	LAION
Jurassic‑2	Undisclosed	April 2023	AI21 Labs
Dolly	6B, 12B	March/April 2023	Databricks
BloombergGPT	50B	March 2023	Bloomberg
GPT‑4	Undisclosed	March 2023	OpenAI
Bard	Undisclosed	March 2023	Google
Stanford-Alpaca	7B	March 2023	Stanford University
LLaMA	7B, 13B, 33B, 65B	February 2023	Meta
ChatGPT	Undisclosed	November 2022	OpenAI
GPT‑3.5	175B	November 2022	OpenAI
Jurassic‑1	178B	November 2022	AI21
Galactica	120B	November 2022	Meta
Sparrow	70B	September 2022	DeepMind
NLLB	54.5B	July 2022	Meta
BLOOM	176B	July 2022	BigScience (Hugging Face)
AlexaTM	20B	August 2022	Amazon
UL2	20B	May 2022	Google
OPT	175B	May 2022	Meta (Facebook)
PaLM	540B	April 2022	Google
AlphaCode	41.4B	February 2022	DeepMind
Chinchilla	70B	March 2022	DeepMind
GLaM	1.2T	December 2021	Google
Macaw	11B	October 2021	Allen Institute for AI
T0	11B	October 2021	Hugging Face
Megatron‑Turing-NLG	530B	January 2022	Microsoft & NVIDIA
LaMDA	137B	January 2022	Google
Gopher	280B	December 2021	DeepMind
GPT‑J	6B	June 2021	EleutherAI
GPT‑NeoX-2.0	20B	February 2022	EleutherAI
T5	60M, 220M, 770M, 3B, 11B	October 2019	Google
BERT	108M, 334M, 1.27B	October 2018	Google

⬆️ Go to top

Optimization

⬆️ Go to top

Tools (GitHub)

Stanford Alpaca - - A repository of Stanford Alpaca project, a model fine-tuned from the LLaMA 7B model on 52K instruction-following demonstrations.
LoRA - - An implementation of "LoRA: Low-Rank Adaptation of Large Language Models".
Dolly - - A large language model trained on the Databricks Machine Learning Platform.
DeepSpeed - - A deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
LMFlow - - An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Model for All.
Helicone AI - - Open-source LLM observability platform for logging, monitoring, and debugging AI applications. Simple 1-line integration to get started.
Promptify - - An utility / tookit for Prompt engineering.
Auto-GPT - - An experimental open-source attempt to make GPT-4 fully autonomous.
Jarvis - - A system to connect LLMs with ML community, a composite model connector via the LLM interface.
dalai - - The cli tool to run LLaMA on the local machine.
haystack - -an open source NLP framework to interact with the data using Transformer models and LLMs.
langchain - - The library which assists in the development of applications with LLM.
langflow - - An UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
deeplake - - Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets.
alpaca-lora - - Instruct-tune LLaMA on consumer hardware.
bosquet - - LLMOps for Large Language Model based applications.
llama_index - - A project that provides a central interface to connect your LLM's with external data.
gradio - - An UI helper for the machine learning model.
sharegpt - - An open-source Chrome Extension for you to share your wildest ChatGPT conversations with one click.
Starwhale - - An MLOps/LLMOps platform for model building, evaluation, and fine-tuning.
keras-nlp - - A natural language processing library that supports users through their entire development cycle.
Snowkel AI - - The data platform for foundation models.
promptflow - - A toolkit that simplifies the development of LLM-based AI applications, from ideation to deployment.

⬆️ Go to top

Tools (Other)

PaLM2 API - An API service that makes PaLM2, Large Language Models (LLMs), available to Google Cloud Vertex AI.
Perspective API - A tool that can help mitigate toxicity and ensure healthy dialogue online.
LangSmith - A monitoring and debugging platform by the LangChain team that provides systematic performance tracking, error analysis, and logging for LLM-based applications.
OpenLLM (by BentoML) - A deployment tool from BentoML that simplifies serving various large language models in production environments.
PromptLayer - A tool for tracking and analyzing prompt engineering experiments, helping optimize prompt performance and outcomes.

⬆️ Go to top

RLHF

evals - - A curated list of reinforcement learning with human feedback resources.
trlx - - A repo for distributed training of language models with Reinforcement Learning via Human Feedback. (RLHF)
PaLM-rlhf-pytorch - - Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture.

⬆️ Go to top

Awesome

⬆️ Go to top

Contributing

We welcome contributions to the Awesome LLMOps list! If you'd like to suggest an addition or make a correction, please follow these guidelines:

Fork the repository and create a new branch for your contribution.
Make your changes to the README.md file.
Ensure that your contribution is relevant to the topic of LLMOps.
Use the following format to add your contribution:

[Name of Resource](Link to Resource) - Description of resource

Add your contribution in alphabetical order within its category.
Make sure that your contribution is not already listed.
Provide a brief description of the resource and explain why it is relevant to LLMOps.
Create a pull request with a clear title and description of your changes.

We appreciate your contributions and thank you for helping to make the Awesome LLMOps list even more awesome!

⬆️ Go to top

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
LICENSE		LICENSE
README.md		README.md
cover.png		cover.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome LLMOps

Table of Contents

What is LLMOps?

LLMOps vs MLOps

Prompt Engineering

Models

Optimization

Tools (GitHub)

Tools (Other)

RLHF

Awesome

Contributing

About

Releases

Packages

Contributors 3

License

KennethanCeyer/awesome-llmops

Folders and files

Latest commit

History

Repository files navigation

Awesome LLMOps

Table of Contents

What is LLMOps?

LLMOps vs MLOps

Prompt Engineering

Models

Optimization

Tools (GitHub)

Tools (Other)

RLHF

Awesome

Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Packages