Skip to content

Kiln-AI/Kiln

Repository files navigation

Kiln AI Logo

CI Build and Test Format and Lint Desktop Apps Build Web UI Build Test Count Badge Test Coverage Badge Docs
Package PyPI - Version PyPI - Python Version
Meta uv linting - Ruff Hatch project types - Pyright Docs
Apps MacOS Windows Linux

Kiln AI

Build High Quality AI Products with Datasets

Key Features

  • 🚀 Intuitive Desktop Apps: Apps for Windows and MacOS. Anyone on your team can contribute to your datasets & feedback to improve quality, without code.
  • 🤝 Team Collaboration: Designed for collaboration, with git-based version control for your AI dataset.
  • 📊 Rapid Evaluation: Compare various models and data-science techniques with no-code tools.
  • 🤖 Synthetic Data Generation: Generate data with our no-code visual tooling.
  • 🔄 Iterative Improvement: Continuous dataset improvement with built-in evaluation, RLHF and QA tooling.
  • 🧑‍💻 Open-Source Library and API: Our python library and OpenAPI REST API are MIT open source.
  • 🔒 Privacy-First: We can't see your data. Bring your own API keys or run locally with Ollama.
  • 🗃️ Structured Data: Build AI tasks which speak JSON. Build training datasets with validated formats, human feedback, issue tracking, and more.
  • 💰 Free: Our apps are free and our library is open-source.

Download Kiln Desktop Apps

The Kiln desktop app is a free download, available on MacOS and Windows. We have experimental Linux support.

Download button

Preview of App

Install Python Library

PyPI - Version Docs

Our open-source python library allows for deep custom integrations. Read the docs.

pip install kiln-ai

Learn More

Build High Quality AI Products with Datasets

Products don’t naturally have “datasets”, but Kiln helps you create one.

Every time you use Kiln, we capture the inputs, outputs, human ratings, feedback, and repairs needed to build high quality models for use in your product. The more you use it, the more data you have.

Your model quality improves automatically as the dataset grows, by giving the models more examples of quality content (and mistakes).

If your product goals shift or new bugs are found (as is almost always the case), you can easily iterate the dataset to address issues.

Collaborate Across Technical and Non-Technical Teams

When building AI products, there’s usually a subject matter expert who knows the problem you are trying to solve, and a different technical team assigned to build the model. Kiln bridges that gap as a collaboration tool.

Subject matter experts can use our easy to use desktop apps to generate structured datasets and ratings, without coding or using technical tools. No command line or GPU required.

Data-scientists can consume the dataset created by subject matter experts, using the UI, or dive deep with our python library.

QA and PM can easily identify issues sooner and help generate the dataset content needed to fix the issue at the model layer.

The dataset file format is designed to be be used with Git for powerful collaboration and attribution. Many people can contribute in parallel; collisions are avoided using UUIDs, and attribution is captured inside the dataset files. You can even share a dataset on a shared drive, letting completely non-technical team members contribute data and evals without knowing Git.

Compare Models and Techniques Without Code

There are new models and techniques emerging all the time. Kiln makes it easy to try a variety of approaches, and compare them in a few clicks, without writing code. These can result in higher quality, or improved performance (smaller/cheaper/faster models at the same quality).

Our current beta supports:

  • Various prompting techniques: basic, few-shot, multi-shot, repair & feedback
  • Many models: GPT, Llama, Claude, Gemini, Mistral, Gemma, Phi
  • Chain of thought prompting, with optional custom “thinking” instructions

In the future, we plan to add more powerful no-code options like fine tuning, lora, evals, and RAG. For experienced data-scientists, you can create these techniques today using Kiln datasets and our python library.

Structured Data

We prioritize data correctness, which makes integrating into AI products easier. No data gets into the dataset without first passing validation, which keeps the dataset clean.

Our easy to use schema UI lets you create and use structured schemas, without knowing JSON-schema formating. For technical users, we support any valid JSON-schema for inputs and outputs.

Powerful Open-source Library and REST API

We have an open-source python library, and an open-source REST API. You can easily integrate Kiln datasets into your own workflows, build fine tunes, use Kiln in Notebooks, build custom tools, and much more!

The REST API supports OpenAPI, so you can generate client libraries for almost any language.

Privacy

Your data stays completely private and local to your machine. We never collect or have access to:

  • Datasets / Training Data
  • API keys
  • Model inputs/outputs (runs)

You can run completely locally using Ollama, or bring your own keys for OpenAI, OpenRouter, Groq, AWS, etc.

Note: We collect anonymous usage metrics via Posthog analytics (never including dataset content or PII). This can be blocked with standard ad-blockers.

Contributing & Development

See CONTRIBUTING.md for information on how to setup a development environment and contribute to Kiln.

Licenses & Trademarks

Copyright 2024 - Chesterfield Laboratories Inc.