Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/add vllm deploy #82

Merged
merged 23 commits into from
Nov 6, 2023
Merged

Feature/add vllm deploy #82

merged 23 commits into from
Nov 6, 2023

Conversation

ruiyiw
Copy link
Collaborator

@ruiyiw ruiyiw commented Oct 30, 2023

Closes #77

πŸ“‘ Description

βœ… Checks

  • My pull request adheres to the code style of this project
  • My code requires changes to the documentation
  • I have updated the documentation as required
  • All the tests have passed
  • Branch name follows type/descript (e.g. feature/add-llm-agents)
  • Ready for code review

β„Ή Additional Information

@ruiyiw ruiyiw merged commit aea8ad6 into main Nov 6, 2023
3 checks passed
lwaekfjlk added a commit that referenced this pull request Nov 16, 2023
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
lwaekfjlk pushed a commit that referenced this pull request Nov 17, 2023
lwaekfjlk added a commit that referenced this pull request Nov 17, 2023
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
lwaekfjlk added a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
lwaekfjlk added a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
lwaekfjlk pushed a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
(cherry picked from commit 1730759)
lwaekfjlk pushed a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
(cherry picked from commit 1730759)
lwaekfjlk added a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
(cherry picked from commit 1730759)
Signed-off-by: Haofei Yu <[email protected]>
lwaekfjlk pushed a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
(cherry picked from commit 1730759)
lwaekfjlk pushed a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
(cherry picked from commit 1730759)
lwaekfjlk pushed a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
(cherry picked from commit 1730759)
lwaekfjlk pushed a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
(cherry picked from commit 1730759)
lwaekfjlk pushed a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
(cherry picked from commit 1730759)
lwaekfjlk added a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
lwaekfjlk added a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
lwaekfjlk added a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
(cherry picked from commit 1730759)
Signed-off-by: Haofei Yu <[email protected]>
lwaekfjlk pushed a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
(cherry picked from commit 1730759)
lwaekfjlk pushed a commit that referenced this pull request Mar 14, 2024
lwaekfjlk added a commit that referenced this pull request Mar 14, 2024
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

* Add fastchat api server tutorial

* Minor modification on the deployment tutorial

---------

Co-authored-by: lwaekfjlk <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEAT]: Deploy fine-tuned models on babel server and provide API
2 participants