This is a command-line application that utilizes the free version of the Assembly AI API.
This app transcribes any audio files into texts. There are 4 types of transcribing methods:
Transcribe a video/audio file.
- You can also enable Speaker Diarization to detect speakers in a video/audio file.
Generating subtitles for video/audio files.
You can export your completed transcripts in SRT or VTT format, which can be used for subtitles and closed captions in videos.
- SRT (SubRip Text) files are commonly used to store subtitles for videos. The format is plain text, and it contains the timing information for each subtitle along with the subtitle text itself.
- WEBVTT (Web Video Text Tracks), which is a standard format for displaying timed text tracks (such as subtitles or captions) within HTML5 video.
Identifying highlights in video/audio files.
The Key Phrases model identifies significant words and phrases in your transcript and lets you extract the most important concepts or highlights from your audio or video file.
Creating summarized chapters from podcasts.
The Auto Chapters model summarizes audio data over time into chapters. Chapters make it easy for users to navigate and find specific information.
Each chapter contains the following:
- Summary
- One-line gist
- Headline
- Start and end timestamps
You can also try their playground website (upload your audio/video file).
- Python installed
- API Key from Assembly AI
python3 -m venv env
py -m venv env
source env/bin/activate
.\env\Scripts\activate
python3 -m pip install -r requirements.txt
py -m pip install -r requirements.txt
python3 -m pip freeze > requirements.txt
py -m pip freeze > requirements.txt