Text-to-Talk is a FastAPI application providing APIs for text summarization and text-to-speech conversion. It utilizes pre-trained models from Hugging Face's Transformers library using the Facebook's BART and the Suno Bark model for TTS.
- Summarize text using the BART model.
- Convert text to speech using the Bark model.
- Combine text summarization and text-to-speech into one endpoint.
Ensure you have the following installed:
- Node.js (for the client)
- Python 3.9+ (for the server)
- Git (for cloning the repository)
Follow these steps to set up and run the application:
Clone this repository to your local machine:
git clone https://github.com/iAmmar7/text-to-talk.git
cd text-to-talk
cd server
python -m venv venv
source venv/bin/activate # On Linux/Mac
venv\Scripts\activate # On Windows
pip install -r requirements.txt
uvicorn app.main:app --reload
The server will be available at http://localhost:8000
Once the application is running, you can access the API documentation at:
Swagger UI: http://127.0.0.1:8000/docs ReDoc: http://127.0.0.1:8000/redoc
cd client
npm install
npm run dev
The client will be available at http://localhost:5173.
- Start the backend server first.
- Start the frontend client.
- Open http://localhost:5173 in your browser to interact with the application.