Help: Starting #53

aemonge · 2023-12-14T14:43:58Z

Is there a tutorial on how to start the server with a downloaded model from hugging face?

So far, I have figured out how to build:

cargo build -r

Thought I have downloaded the WizardCoder-Python-34B-V1.0 but can't start the server. Ive tried:

❯ ./target/release/llm-ls /vault/models/WizardCoder/

Content-Length: 75

{"jsonrpc":"2.0","error":{"code":-32700,"message":"Parse error"},"id":null}%

❯ ./target/release/llm-ls /vault/models/WizardCoder/WizardCoder-Python-34B-V1.0.bin

Content-Length: 75

{"jsonrpc":"2.0","error":{"code":-32700,"message":"Parse error"},"id":null}%

The text was updated successfully, but these errors were encountered:

McPatate · 2023-12-14T16:57:13Z

@aemonge llm-ls does not run the model for you, it only sends request to a backend API running a model.
I'd suggest looking at https://github.com/huggingface/text-generation-inference, either on your computer or on a remote host. You can also use https://github.com/mlc-ai/mlc-llm IIRC.
Finally, https://ollama.ai/ should be compatible to llm-ls pretty soon.

aemonge · 2023-12-14T17:08:51Z

I've never heard about ollama, but seams to be what I was looking for <3

Is there an issue liken to the ollama integration, so that I can be notified when it's on ?

Moreover, thank you very much for the support and guides

McPatate · 2023-12-15T13:59:21Z

#40

No problem!

McPatate closed this as completed Dec 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Help: Starting #53

Help: Starting #53

aemonge commented Dec 14, 2023

McPatate commented Dec 14, 2023

aemonge commented Dec 14, 2023

McPatate commented Dec 15, 2023

Help: Starting #53

Help: Starting #53

Comments

aemonge commented Dec 14, 2023

McPatate commented Dec 14, 2023

aemonge commented Dec 14, 2023

McPatate commented Dec 15, 2023