Skip to content

Confidential computing demo on how to use LLMs on GWS encrypted docs

License

Notifications You must be signed in to change notification settings

Cosmian/cosmian-ai-runner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cosmian AI runner

Confidential computing backend to run AI models

Structure

The AI Runner is a Flask-based application that provides endpoints for performing inference across various AI tasks and pipelines, such as text summarization, translation, text querying, and retrieval-augmented generation (RAG) over document databases.

These pipelines are constructed using the Haystack library (https://haystack.deepset.ai/) as a foundation. Users have the flexibility to customize the pipelines by modifying the loaded models, selecting a preferred vector database, and tailoring the setup to meet specific requirements.

Usage

  • Build and install the app

  • Edit the config file (more info)

  • Run the app

CONFIG_PATH="./run/config.json" cosmian-ai-runner --port 5001
Using current model, you need to add your HuggingFace token as an env variable (HF_API_TOKEN). 

Details of the API Endpoints are explained in the app/ folder of the repository.

About

Confidential computing demo on how to use LLMs on GWS encrypted docs

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages