Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Basic speech-to-text #102

Open
bramiozo opened this issue Jun 24, 2024 · 0 comments
Open

Basic speech-to-text #102

bramiozo opened this issue Jun 24, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@bramiozo
Copy link
Collaborator

Describe the feature
A basic speech-to-text that ingests a .wav and outputs a dictionary;

document: 
 id: XX 
 text_estimate: blaat whooop ...
 word:
   id: 0
   text_verbose:  bladiebla
   text_estimate: blaat
   start_dt: 00:00:13
   end_dt: 00:12:00
 word:
   id: 1
   text_verbose:  whoopwhoop
   text_estimate: whooop
   start_dt: 00:15:13
   end_dt: 00:20:21

A use case for the feature
Fast, verbose text-to-speech with timestamps.

Would you like to be involved in development?
Yo :D

Additional context

There is a concrete use case for this regarding pediatrics and language development.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: Later
Development

No branches or pull requests

1 participant