Simona: AI Computer Operator

AI voice agent who is capable of seeing the screen, pressing buttons, and typing text, i.e. autonomously operating a computer on your behalf. It uses open-source local or self-hosted models with no dependencies on external APIs and sharing personal data with third parties.

Design

Key components:

Voice Activity Detection (VAD)
Speech to Text (STT)
Language Model (LM)
Text to Speech (TTS)

References and inspiration

I got inspired by the following projects:

The reason I don't fork them is that my vision is slightly different, and I want to learn how to create such system from scratch.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
backend		backend
images		images
nextjs-client		nextjs-client
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simona: AI Computer Operator

Design

References and inspiration

About

Releases

Packages

Languages

hiper2d/simona-ai-computer-operator

Folders and files

Latest commit

History

Repository files navigation

Simona: AI Computer Operator

Design

References and inspiration

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages