GPT-4o-oIP is designed to empower developers to create telephony use cases for state-of-the-art multimodal AI models with Asterisk, a powerful open-source private branch exchange. This project aims to reduce setup time and cost by providing preconfigured containers that run locally without subscription. It currently supports Google DialogFlow ES, with future plans to incorporate OpenAI's GPT-4o and Kyutai's Moshi AI when they are released to developers.
Please see the project's wiki for installation, configuration, and usage instructions.
- Still waiting on the release of GPT-4o by OpenAI
- Still waiting on the release of Moshi AI by Kyutai
- Currently configured to work with DialogFlow ES by Google
- Form-based bot builder
- Natural language understanding (NLU) models
One-click telephony integrationbut you're here to run Asterisk locally, right?- Speech recognition and speech synthesis models
- 40+ template agents for building conversations for dining out, hotel booking, navigation, IoT, and more
- Integration into popular channels, such as Google Assistant, Slack, Twitter, and others
- Performance and custom dashboards
- Asterisk powering IP PBX systems and VoIP gateways
- PrivateDial Lite, customizable Asterisk configuration based on mlan's PrivateDial
- AutoBan, a built in intrusion detection and prevention system
- Additionally provide the G.729 and G.723.1 audio codecs
- Small image size based on Alpine Linux
- Automatic integration of Let’s Encrypt TLS certificates using the reverse proxy Traefik
- Persistent storage facilitated by configuration and run data being consolidated under
/srv
- Container audio using the pulse socket of the host
- Use runit, providing an init scheme and service supervision
- Health check
- Log directed to docker daemon with configurable level
- Multi-staged build providing the images
mini
,base
,full
andxtra