Files
Open-Assistant/docs/docs/intro.md
T
Andrew Maguire d24c127950 Docs site poc (#346)
add docusaurus based docs site under /docs
2023-01-06 22:48:31 +00:00

1.2 KiB

Introduction

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

It can be extended and personalized easily and is developed as free, open-source software.

Our Vision

We want OpenAssistant to be the single, unifying platform that all other systems use to interface with humans.

Principles

  • We put the human in the center
  • We need to get the MVP out fast, while we still have momentum
  • We pull in one direction
  • We are pragmatic
  • We aim for models that can (or could, with some effort) be run on consumer hardware
  • We rapidly validate our ML experiments on a small scale, before going to a supercluster

Main Efforts

  • Data Collection Code → Backend, website, and discord bot to collect data
  • Instruction Dataset Gathering → Scraping & cleaning web data
  • Gamification → Leaderboards & more, to make data collection more fun
  • Model Training → Experiments on pseudo- and real-data
  • Infrastructure → Collection, training, and inference
  • Data Collection → This is the bulk of the work
  • Data Augmentation → Making more data from little data
  • Privacy and Safety → Protecting sensitive data