You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add initial support for transcriptions. Apple now supports transcriptions in podcasts.
Some audiobook files have transcriptions, and currently, we can use tools based on Whisper to transcribe audio to text.
In fact, most software based on Whisper support transcribing audio to text and exporting it as an SRT or VTT file. VTT is a native format for the web, and SRT is a common format for subtitles.
I'm creating this issue to discuss the best way to implement transcription support on the web player. I'm trying to implement some features on the pull request #
Note: I think we need define a standard for multi-language transcriptions. For example use some prefix in the file name like en- for English and es- for Spanish.
UI Ideas on the Web Player:
What's the best way to display the transcription on the web player?
Good: because the audio player is omnipresent on all pages and the transcription can be displayed in a fixed position.
Bad: Not enough space to display multiple lines of text.
Modal.
Good: More space to display multiple lines of text. Can float over the UI.
Bad: The modal can be intrusive.
Lateral Panel. (like the iTunes/Apple Music).
Good: More space to display multiple lines of text. Can have a search feature. Better for implement the seek feature (click on the line and seek to the corresponding time).
Bad: Take up space on the screen. Not good for small screens.
IMHO, when it comes to the UI you should combine both, the big panel for browsing and the panel below controls perhaps just with the current line but bigger. You have to take into account accessibility, some folks will want it to be resizable.
Describe the feature/enhancement
Transcription/Subtitle support
Summary
Add initial support for transcriptions. Apple now supports transcriptions in podcasts.
Some audiobook files have transcriptions, and currently, we can use tools based on Whisper to transcribe audio to text.
In fact, most software based on Whisper support transcribing audio to text and exporting it as an SRT or VTT file. VTT is a native format for the web, and SRT is a common format for subtitles.
I'm creating this issue to discuss the best way to implement transcription support on the web player. I'm trying to implement some features on the pull request #
Podcast transcription is supported by:
Possible tasks:
<podcast:transcript>
tag in RSS feed. Apple Docs / Podcast NamespaceUI Ideas on the Web Player:
What's the best way to display the transcription on the web player?
Related
The text was updated successfully, but these errors were encountered: