GenAI Studio streamlines the creation of enterprise Generative AI applications by providing alternate to manual processes with a seamless, end-to-end solution. From GenAI app development and evaluation to performance benchmarking and deployment, GenAI Studio empowers developers to effortlessly build, test, optimize their LLM solutions and create the deployment package. Its intuitive no-code/low-code interface accelerates innovation, enabling rapid development and deployment of cutting-edge AI applications with unparalleled efficiency and precision.
🌟Key Capabilities
- 🛠️ Build & configure GenAI applications rapidly with low-code and no setup hassle
- 🔍 Instant evaluation with sandbox on Kubernetes Namespace
- 📊 Dynamic performance measurement & benchmarking with Grafana dashboarding via Prometheus
- ⚙️ One-click deployment package generation for instant application setup
GenAI Studio revolutionizes application development by eliminating the need for manual scripting. It encapsulates OPEA's GenAIComps microservices into intuitive, configurable UI blocks, enabling users to effortlessly create app workflows through a drag-and-drop interface on a GUI canvas. Once the workflow is designed, users can test its functionality in a sandbox environment within individual Kubernetes namespaces. Performance can be monitored and benchmarked using Prometheus, with detailed insights available via Grafana dashboards for each workflow and microservice. Upon completion, users can generate a deployment package that allows one-click deployment, facilitating instant application setup on their own machines. The package includes scripts for direct setup, featuring Dockerfiles for single-machine deployment and manifests for Kubernetes environments.
- Key Components
- Setting up GenAIStudio
- Getting Started with GenAIStudio
- Known Limitations
- Additional Content
The Kubernetes cluster hosts both the Studio, Sandbox and Monitoring namespaces.
- Studio Namespace: This is the core application namespace containing the Studio-Frontend and Studio-Backend. The Studio-Frontend allows users to manage workflows and build GenAI pipelines. The Studio-Backend includes the Evaluation Sandbox Manager and Deployment Package Generator, which handle deployment of user-designed pipelines.
- Sandbox Namespace(s): Managed by the Studio server, these namespaces create sandbox environments where users can test and evaluate the pipelines constructed through Studio-Frontend.
- Monitoring Namespace: This namespace contains a Prometheus service that collects performance data from each sandbox. These collected metrics are visualized in a Grafana dashboard, enabling users to monitor resource utilization and pipeline performance.
The Studio UI consists two main pages.
- Main Page: Provides an overview of the workflow list, allowing users to easily manage workflows, launch sandboxes, download deployment packages, and manage package manifests.
- Canvas Page: A drag-and-drop interface built on Flowise, enabling users to build GenAI applications by assembling various configurable microservice blocks into a GenAI pipeline. These microservices form the core components necessary for creating Retrieval-Augmented Generation (RAG) applications. The pipeline can be run and tested within the Studio’s Sandbox environment.
A sandbox is automatically created when user constructed a workflow through Studio UI. Each sandbox runs independently, allowing for performance testing and monitoring. The sandbox namespace, managed by the Studio-Backend, contains necessary microservices based on the pipeline constructed and includes a App UI. Users can test the pipeline's inference performance and track resource utilization via an integrated Grafana dashboard.
GenAIStudio currently supports a subset of microservices from GenAIComps, including DataPrep with Redis, TEI Embedding, Retriever with Redis, Reranks, LLMs and Guardrails. This list is expected to grow in future releases, expanding the range of services available for building and testing GenAI pipelines.
Users can download the deployment package in a ZIP file from the Studio UI and deploy it locally using Docker Compose. This local setup mirrors the sandbox environment, providing the same application configuration and microservices. It enables users to seamlessly deploy the GenAI application constructed in GenAIStudio on their local machine with no hussle.
If you're part of a team and want to provide a testing playground for your members, you'll need to set up GenAIStudio. This setup allows your team members to test and evaluate their pipelines in a sandbox environment without worrying about infrastructure management.
However, if you already have access to an existing Studio instance, you can skip the setup process and move directly to the next section to begin working with your workflows.
Memory: >64GiB (To run a ChatQnA Sandbox, approx. 45GiB memory is needed)
Storage: >100GiB
OS: Linux Debian-based distribution
Note: This setup has been validated on a system running Ubuntu 22.04 on an Intel(R) Xeon(R) Platinum 8468 server with 1000GB memory and 300GB local storage.
GenAIStudio requires an on-premise Kubernetes cluster. If your server does not have Kubernetes set up, please install by following the Kubernetes official setup guide. Alternatively, you can try out our setup onpremise kubernetes script.
Note: GenAIStudio have only been validated in a single node bare-metal kubernetes setup using setup onpremise kubernetes script.
The installation is done using genai-studio playbook script. The script will
- Deploy a persistent volume for prometheus and a customized monitoring stack based on prometheus-community/kube-prometheus-stack (which contains both Prometheus and Grafana) in the monitoring namespace.
- Deploy the studio-backend, studio-frontend and also a studio-nginx in the studio namespace.
The installation can be done with the following steps:
-
Clone this repo to your server.
git clone https://github.com/opea-project/GenAIStudio cd GenAIStudio
-
Run the commands below
sudo apt install ansible -y cd setup-scripts/setup-genai-studio ansible-playbook genai-studio.yml
Note: you can review the deployment configurations in genai-studio.yml
-
Configure admin console
Refer to the Admin Console Configuration Guide to learn how to configure and customize the admin console, as well as manage user permissions.
You can access the the Studio UI in a web browser at https://<studio_server_ip>:30007
, which will take you to the login page.
You might see a net::ERR_CERT_AUTHORITY_INVALID
warning in your browser, as shown in the screenshot below. This warning occurs because a self-signed SSL/TLS certificate is being used. If you are accessing the Studio setup on an internal private network, you can safely ignore this warning by clicking the Proceed button.
If you are a first-time user, you will need to register as a new user, and request approval from your admin before gaining access to the Main Page
. Once your registration is approved, simply refresh the page to proceed.
Get started quickly by running a built-in sample workflow:
-
On the
Main Page
, click on Import Sample Workflow and you will be able to see a list of sample workflows. -
Click on
sample_workflow_chatqna
to open the workflow in the canvas. Review the nodes and their configurations. You can modify the models to your preference and enter the Hugging Face token if needed. -
Once everything is in place, you can return to the main page and click the run button to launch the sandbox.
-
Create a new workflow:
On the
Main Page
, click Create New Workflow. This will open a blankCanvas Page
where you can start building your new workflow. -
Drag-and-drop microservice nodes:
In the top-left corner of the canvas, click the ➕ Add Nodes button to view a dropdown list of available microservice nodes. You can expand each category and drag nodes onto the canvas. Use your mouse scroll to zoom in and out of the canvas for easier navigation.
-
Configure and connect microservice nodes:
Some nodes have configurable parameters that let you set specific settings. After configuring a node, you can connect it to another node by dragging the output edge and dropping it onto the input edge of the target node. Once your workflow is complete, click the 💾 Save Workflow button to save your project.
*The saved workflow can be imported by selecting
Import Workflow
button from the dropdown of the ⚙️ icon located at the top right of workflow canvas. -
Return to the main page and click the run button to launch the sandbox.
Back on the Main Page, you’ll see the saved workflow under the Workflow list. Click the
-
Conversational Interactions with Custom Settings
Engage in a conversation with the chatbot using various parameter settings. The chatbot’s responses will be tailored based on the content in the uploaded data sources.
-
Upload Data Sources.
Add data sources by uploading files (supported file types: .pdf, .docx, and .txt) or by inserting multiple URLs to online articles.
-
View real-time resource utilization metrics for the Sandbox environment. CPU and memory usage are tracked and visualized in the Grafana dashboard for easy monitoring.
To deploy the downloaded GenAI application locally, ensure that Docker Engine with Docker Compose are installed on your machine. If you haven't installed them yet, please refer to the official Docker documentation for setup instructions.
The downloaded zip file includes the necessary configurations for deploying the microservices. Follow these steps to set up and run the application with Docker Compose:
-
Set the
public_host_ip
environment variables in.env
or export it in your environtment:# Note: public_host_ip is an external public IP # ./.env public_host_ip=192.168.1.1
If you’re in a proxy environment, add or export the relevant proxy settings:
# ./.env public_host_ip=192.168.1.1 http_proxy="Your_HTTP_Proxy" https_proxy="Your_HTTPS_Proxy" no_proxy="Your_No_Proxy,app-nginx,app-frontend,app-backend,...Other_Compose_Services"
Note: To find your public host IP, you can run the following command in your terminal:
curl ifconfig.me
-
Start the application using Docker Compose:
docker compose -f compose.yaml up -d
-
Access the application by opening your web browser and go to:
http://<public_host_ip>:8080
- Having more than one same microservice node within the same workflow is not allowed. The sandbox execution of a workflow with more than one instance of the same microservice node will fail.
- Only deployment on CPU is supported in current GenAIStudio release. Support for GPU or other devices will be available in future release.