# DB-GPT **Repository Path**: bobosui/DB-GPT ## Basic Information - **Project Name**: DB-GPT - **Description**: DB-GPT 是一个开源的以数据库为基础的 GPT 实验项目,使用本地化的 GPT 大模型与数据和环境进行交互,无数据泄露风险,100% 私密,100% 安全 - **Primary Language**: Python - **License**: MIT - **Default Branch**: main - **Homepage**: https://www.oschina.net/p/db-gpt - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 52 - **Created**: 2023-08-21 - **Last Updated**: 2023-08-21 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # DB-GPT: Revolutionizing Database Interactions with Private LLM Technology

stars forks License: MIT License: MIT Release Notes Open Issues Discord Open in GitHub Codespaces

[**简体中文**](README.zh.md) |[**Discord**](https://discord.gg/Y5rbqj4U) |[**Documents**](https://db-gpt.readthedocs.io/en/latest/)|[**Wechat**](https://github.com/eosphoros-ai/DB-GPT/blob/main/README.zh.md#%E8%81%94%E7%B3%BB%E6%88%91%E4%BB%AC)|[**Community**](https://github.com/eosphoros-ai/community)
## What is DB-GPT? DB-GPT is an experimental open-source project that uses localized GPT large models to interact with your data and environment. With this solution, you can be assured that there is no risk of data leakage, and your data is 100% private and secure. ## Contents - [install](#install) - [demo](#demo) - [introduction](#introduction) - [features](#features) - [contribution](#contribution) - [acknowledgement](#acknowledgement) - [roadmap](#roadmap) - [contract](#contact-information) [DB-GPT Youtube Video](https://www.youtube.com/watch?v=f5_g0OObZBQ) ## Demo Run on an RTX 4090 GPU. https://github.com/eosphoros-ai/DB-GPT/assets/13723926/55f31781-1d49-4757-b96e-7ef6d3dbcf80 #### Chat with data, and figure charts.

#### Text2SQL, generate SQL from chat

#### Knowledge space to manage docs.

#### Chat with knowledge, such as url, pdf, csv, word. etc

## Features Currently, we have released multiple key features, which are listed below to demonstrate our current capabilities: - SQL language capabilities - SQL generation - SQL diagnosis - Private domain Q&A and data processing - Knowledge Management(We currently support many document formats: txt, pdf, md, html, doc, ppt, and url.) - Database knowledge Q&A - knowledge Embedding - ChatDB - ChatDashboard - Plugins - Support custom plugin execution tasks and natively support the Auto-GPT plugin, such as: - Automatic execution of SQL and retrieval of query results - Automatic crawling and learning of knowledge - Unified vector storage/indexing of knowledge base - Support for unstructured data such as PDF, TXT, Markdown, CSV, DOC, PPT, and WebURL - Multi LLMs Support, Supports multiple large language models, currently supporting - 🔥 Vicuna-v1.5(7b,13b) - 🔥 llama-2(7b,13b,70b) - WizardLM-v1.2(13b) - Vicuna (7b,13b) - ChatGLM-6b (int4,int8) - ChatGLM2-6b (int4,int8) - guanaco(7b,13b,33b) - Gorilla(7b,13b) - baichuan(7b,13b) [![Star History Chart](https://api.star-history.com/svg?repos=csunny/DB-GPT)](https://star-history.com/#csunny/DB-GPT) ## Introduction DB-GPT creates a vast model operating system using [FastChat](https://github.com/lm-sys/FastChat) and offers a large language model powered by [Vicuna](https://huggingface.co/Tribbiani/vicuna-7b). In addition, we provide private domain knowledge base question-answering capability. Furthermore, we also provide support for additional plugins, and our design natively supports the Auto-GPT plugin.Our vision is to make it easier and more convenient to build applications around databases and llm. Is the architecture of the entire DB-GPT shown in the following figure:

The core capabilities mainly consist of the following parts: 1. Knowledge base capability: Supports private domain knowledge base question-answering capability. 2. Large-scale model management capability: Provides a large model operating environment based on FastChat. 3. Unified data vector storage and indexing: Provides a uniform way to store and index various data types. 4. Connection module: Used to connect different modules and data sources to achieve data flow and interaction. 5. Agent and plugins: Provides Agent and plugin mechanisms, allowing users to customize and enhance the system's behavior. 6. Prompt generation and optimization: Automatically generates high-quality prompts and optimizes them to improve system response efficiency. 7. Multi-platform product interface: Supports various client products, such as web, mobile applications, and desktop applications. ### SubModule - [DB-GPT-Hub](https://github.com/eosphoros-ai/DB-GPT-Hub) Text-to-SQL parsing with LLMs - [DB-GPT-Plugins](https://github.com/eosphoros-ai/DB-GPT-Plugins) DB-GPT Plugins, Can run autogpt plugin directly - [DB-GPT-Web](https://github.com/eosphoros-ai/DB-GPT-Web) ChatUI for DB-GPT ## Image 🌐 [AutoDL Image](https://www.codewithgpu.com/i/csunny/DB-GPT/dbgpt-0.3.1-v2) ## Install ![Docker](https://img.shields.io/badge/docker-%230db7ed.svg?style=for-the-badge&logo=docker&logoColor=white) ![Linux](https://img.shields.io/badge/Linux-FCC624?style=for-the-badge&logo=linux&logoColor=black) ![macOS](https://img.shields.io/badge/mac%20os-000000?style=for-the-badge&logo=macos&logoColor=F0F0F0) [**Quickstart**](https://db-gpt.readthedocs.io/en/latest/getting_started/install/deploy/deploy.html) ### Language Switching In the .env configuration file, modify the LANGUAGE parameter to switch to different languages. The default is English (Chinese: zh, English: en, other languages to be added later). ## Usage Instructions If nltk-related errors occur during the use of the knowledge base, you need to install the nltk toolkit. For more details, please refer to: [nltk documents](https://www.nltk.org/data.html) Run the Python interpreter and type the commands: ```bash >>> import nltk >>> nltk.download() ``` ## Acknowledgement This project is standing on the shoulders of giants and is not going to work without the open-source communities. Special thanks to the following projects for their excellent contribution to the AI industry: - [FastChat](https://github.com/lm-sys/FastChat) for providing chat services - [vicuna-13b](https://lmsys.org/blog/2023-03-30-vicuna/) as the base model - [langchain](https://langchain.readthedocs.io/) tool chain - [Auto-GPT](https://github.com/Significant-Gravitas/Auto-GPT) universal plugin template - [Hugging Face](https://huggingface.co/) for big model management - [Chroma](https://github.com/chroma-core/chroma) for vector storage - [Milvus](https://milvus.io/) for distributed vector storage - [ChatGLM](https://github.com/THUDM/ChatGLM-6B) as the base model - [llama_index](https://github.com/jerryjliu/llama_index) for enhancing database-related knowledge using [in-context learning](https://arxiv.org/abs/2301.00234) based on existing knowledge bases. ## Contribution - Please run `black .` before submitting the code. contributing guidelines, [how to contribution](https://github.com/csunny/DB-GPT/blob/main/CONTRIBUTING.md) ## RoadMap

## Licence The MIT License (MIT) ## Contact Information We are working on building a community, if you have any ideas about building the community, feel free to contact us. [![](https://dcbadge.vercel.app/api/server/Y5rbqj4U?compact=true&style=flat)](https://discord.gg/Y5rbqj4U)