Integration of generic LLM API (e.g. Xinference) #24

AndiMajore · 2023-10-24T08:37:45Z

Adds generic implementation of the Conversation class to allow for use of non-openai but openai style endpoints.
Includes the addition of customized openai.py > generic_openai.py

to disable tokenization based on modelname
to override the modelname based URL use to have static URL endpoint but model_uid in request for model selection

slobentanzer · 2023-11-03T17:56:28Z

Thanks for the PR, @AndiMajore!

@fengsh27, this is the local LLM application that we briefly talked about, I think it would be helpful for you to see the implementation, and helpful for the PR if you could give your opinion / input on the code.

In particular, I would like to reduce redundancies in the newly established generic OpenAI connectivity and the one that previously existed. It would be great if the existing GPTConversation could inherit from the generic one with some fixed parameters; and if the tests reflected this, equally.

I will be back to being able to code next week, so I'm happy to get practically involved as well. @AndiMajore, could you summarise again what kind of conflicts we have between the OpenAI API and the custom Xinference one?

Ideally, we would end up with a generic OpenAIConversation, that can have Xinference and official OpenAI API (regular and Azure) children (with minimal code duplication). So the hierarchy would be:

Conversation
- OpenAIConversation ( + HuggingFaceConversation etc )
-- XinferenceEndpointOpenAIConversation, OfficialEndpointOpenAIConversation, AzureEndpointOpenAIConversation

Maybe we can come up with more concise names for the classes. ;)

… based

AndiMajore · 2023-11-21T21:59:27Z

Hey @slobentanzer ,
I just changed the Xinference chatting integration to use the Client class of Xinference and the Xinference embedding integration to use the XinferenceEmbeddings class of langchain.

…pr/AndiMajore/24

explicit (optional) dependency because without, poetry would take .. forever to resolve `xinference` dependencies

instead of "document summarisation"

otherwise default is always used and that prevents testing different .. embedders

mds (copied and adapted from biocypher repo)

is mocked

slobentanzer

Thanks!

AndiMajore added 3 commits October 13, 2023 16:38

added support for generic llms with openai api

d0d36ce

Merge branch 'biocypher:main' into main

c0f1e0e

added some additional comments

473426b

slobentanzer requested review from slobentanzer and fengsh27 October 31, 2023 11:28

slobentanzer and others added 12 commits November 9, 2023 18:13

Merge branch 'main' into pr/AndiMajore/24

4071836

make model name a parameter

378660f

change debugging, switch to gpt-4

a0f3214

potentially no authors in document

ae02840

back to gpt3 as default

1381807

remove completed todos

ff737b6

reduce section length for openai tts further

6568038

improve tts section length control

0bd27aa

changed way of accessing xinference instance from API-based to client…

a3589d0

… based

merged

9eeabde

fixed lock file

aca3311

Merge branch 'main' into main

0224e07

AndiMajore had a problem deploying to Test CI November 21, 2023 21:47 — with GitHub Actions Failure

updated and tested rebased version

c79f13c

AndiMajore had a problem deploying to Test CI November 21, 2023 21:56 — with GitHub Actions Failure

Merge branch 'main' of https://github.com/andimajore/biochatter into …

e3057bf

…pr/AndiMajore/24

slobentanzer had a problem deploying to Test CI November 22, 2023 10:27 — with GitHub Actions Failure

typo

c559ed5

slobentanzer had a problem deploying to Test CI November 22, 2023 10:52 — with GitHub Actions Failure

update pyproject.toml, added botocore as ..

c138cfe

explicit (optional) dependency because without, poetry would take .. forever to resolve `xinference` dependencies

slobentanzer had a problem deploying to Test CI November 22, 2023 14:37 — with GitHub Actions Failure

slobentanzer added 2 commits November 22, 2023 17:03

remove biocypher extra (not required)

28ba3bb

Xinference error handling

6298c01

slobentanzer added 8 commits November 23, 2023 11:54

don't correct when testing

8ec9691

docstrings

378d324

fix test

d5d46bc

add model id also to the existing models entry

06a8e52

comment

afda3e5

switch to "retrieval augmented generation" ..

afa2be4

instead of "document summarisation"

pass collection names to vector database host, ..

abbf1b2

otherwise default is always used and that prevents testing different .. embedders

docstrings

bca7a67

slobentanzer had a problem deploying to Test CI November 23, 2023 12:30 — with GitHub Actions Failure

Merge branch 'biocypher:main' into main

2b85420

AndiMajore had a problem deploying to Test CI November 23, 2023 12:33 — with GitHub Actions Failure

CI: add xinference dependency group

3bb782a

slobentanzer had a problem deploying to Test CI November 23, 2023 12:42 — with GitHub Actions Failure

AndiMajore added 2 commits November 23, 2023 13:54

fixed correct auto selection of Xinference chatting model

2dd9956

Merge branch 'main' of github.com:AndiMajore/biochatter

446b141

AndiMajore had a problem deploying to Test CI November 23, 2023 12:55 — with GitHub Actions Failure

explain auto in docstrings

bbed54b

slobentanzer had a problem deploying to Test CI November 23, 2023 13:01 — with GitHub Actions Failure

AndiMajore added 2 commits November 23, 2023 14:04

fixed model list creation for Xinference models

71a0c32

Merge branch 'main' of github.com:AndiMajore/biochatter

e4c9558

AndiMajore had a problem deploying to Test CI November 23, 2023 13:05 — with GitHub Actions Failure

CI: only run tests, not benchmark

c318784

slobentanzer had a problem deploying to Test CI November 23, 2023 13:10 — with GitHub Actions Failure

add code of conduct, contributing and developer ..

4896587

mds (copied and adapted from biocypher repo)

slobentanzer had a problem deploying to Test CI November 23, 2023 13:29 — with GitHub Actions Failure

do not run CI on PR any more until API access ..

34349d1

is mocked

slobentanzer approved these changes Nov 23, 2023

View reviewed changes

slobentanzer merged commit 6300f9b into biocypher:main Nov 23, 2023

slobentanzer mentioned this pull request Nov 23, 2023

Vector DB: local model (privateGPT) #8

Closed

fengsh27 mentioned this pull request Nov 23, 2023

Testing: introduce mock objects for calling LLM APIs #17

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration of generic LLM API (e.g. Xinference) #24

Integration of generic LLM API (e.g. Xinference) #24

AndiMajore commented Oct 24, 2023

slobentanzer commented Nov 3, 2023 •

edited

Loading

AndiMajore commented Nov 21, 2023

slobentanzer left a comment

Integration of generic LLM API (e.g. Xinference) #24

Integration of generic LLM API (e.g. Xinference) #24

Conversation

AndiMajore commented Oct 24, 2023

slobentanzer commented Nov 3, 2023 • edited Loading

AndiMajore commented Nov 21, 2023

slobentanzer left a comment

Choose a reason for hiding this comment

slobentanzer commented Nov 3, 2023 •

edited

Loading