Basic speech-to-text #102

bramiozo · 2024-06-24T13:08:18Z

Describe the feature
A basic speech-to-text that ingests a .wav and outputs a dictionary;

document: 
 id: XX 
 text_estimate: blaat whooop ...
 word:
   id: 0
   text_verbose:  bladiebla
   text_estimate: blaat
   start_dt: 00:00:13
   end_dt: 00:12:00
 word:
   id: 1
   text_verbose:  whoopwhoop
   text_estimate: whooop
   start_dt: 00:15:13
   end_dt: 00:20:21

A use case for the feature
Fast, verbose text-to-speech with timestamps.

Would you like to be involved in development?
Yo :D

Additional context

There is a concrete use case for this regarding pediatrics and language development.

bramiozo added the enhancement New feature or request label Jun 24, 2024

github-project-automation bot added this to Clinlp development roadmap Jun 24, 2024

github-project-automation bot moved this to Later in Clinlp development roadmap Jun 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Basic speech-to-text #102

Basic speech-to-text #102

bramiozo commented Jun 24, 2024

Basic speech-to-text #102

Basic speech-to-text #102

Comments

bramiozo commented Jun 24, 2024