Multimodality #2141

rmuhawieh · 2025-01-14T17:54:37Z

rmuhawieh
Jan 14, 2025

Hello, I'm thinking of ways to improve my privateGPT and hoping to be able to work with images and potentially audio as a form of context in addition to the already working textual documents. I'm essentially looking for multi-modality. I'm currently using mistral but wondering what it would take to make privateGPT multimodal, as I've heard of llms like LLAVA, CLIP, etc. but not sure what the implementation would look like to make this possible. Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multimodality #2141

{{title}}

Replies: 0 comments

Select a reply

Multimodality #2141

rmuhawieh Jan 14, 2025

Replies: 0 comments

rmuhawieh
Jan 14, 2025