Tru Era Applied

Created by team Team Tonic on December 07, 2023

TruLens BERT Zilliz Vectara Mistral AI StableCode LLavA OpenAI Fuyu-8B

Hackathon Submission: Enhanced Multimodal AI Performance Project Title: Optimizing Multimodal AI for Real-World Applications Overview: Our project focused on optimizing multimodal AI performance using the TruEra Machine Learning Ops platform. We evaluated 18 models across vision, audio, and text domains, employing innovative prompting strategies, performance metrics, and sequential configurations. Methodology: Prompting Strategies: Tailored prompts to maximize model response accuracy. Performance Metrics: Assessed models on accuracy, speed, and error rate. Sequential Configurations: Tested various model combinations for task-specific effectiveness. Key Models Evaluated: Vision: GPT4V, LLava-1.5, Qwen-VL, Clip (Google/Vertex), Fuyu-8B. Audio: Seamless 1.0 & 2.0, Qwen Audio, Whisper2 & Whisper3, Seamless on device, GoogleAUDIOMODEL. Text: StableMed, MistralMed, Qwen On Device, GPT, Mistral Endpoint, Intel Neural Chat, BERT (Google/Vertex). Results: Top Performers: Qwen-VL in vision, Seamless 2.0 in audio, and MistralMed in text. Insights: Balance between performance and cost is crucial. Some models like GPT and Intel Neural Chat underperformed or were cost-prohibitive. Future Directions: Focus on fine-tuning models like BERT using Vertex. Develop more connectors for TruLens for diverse endpoints. Submission Contents: GitHub Repository: [Link] Demo: [Link] Presentation: [Link] Our submission showcases the potential of multimodal AI evaluation using TruEra / TruLens in enhancing real-world application performance, marking a step forward in human-centered AI solutions.

Category tags:

Healthcare

Github Presentation Demo

Explore more applications

CodeSage-AI Code Review Tool

An AI-driven tool that reviews GitHub pull requests in real-time, providing clear and intelligent code feedback using Groq-accelerated LLaMA models and the BLACKBOX.AI Coding Agent.

innoventors-blackbox-track

Groq

FlowRishAI

Flowrish AI helps students think better, not less. It guides reflection instead of giving answers—strengthening minds, not replacing them. Offline-first on Snapdragon X Elite, with LLaMA 3 locally and Groq online. Because learning should grow you.

42AI Qualcomm Track

Groq

AMAGI - Vultr Track

Amagi is a proactive AI assistant that sees your screen, listens, remembers, and helps you stay focused—designed to run across devices with real-time context awareness

The Monad (AI-Smith Protocol) -Vultr Track

GroqLlama 4WhisperQdrant

AI Agent-Powered E-Commerce

An AI-powered shopping assistant built with FastAPI, Groq API (LLaMA models), and Neo4j knowledge graph for personalized e-commerce experiences

Hackcelerate - Prosus Track

GroqLlama 3

SnapChronicles - Qualcomm Track

A privacy-focused toolkit for real-time screen OCR and audio transcription on any PC, combining universal image text extraction, audio-to-text, and fast local semantic search—powered by Edge AI and Groq API.

Illuminative Lab - Qualcomm Track

GroqLlama 3.2

Joseph Pollack
CIO

Events @ lablab
For Innovators & Creators

Explore more applications

CodeSage-AI Code Review Tool

An AI-driven tool that reviews GitHub pull requests in real-time, providing clear and intelligent code feedback using Groq-accelerated LLaMA models and the BLACKBOX.AI Coding Agent.

innoventors-blackbox-track

Groq

FlowRishAI

42AI Qualcomm Track

Groq

AMAGI - Vultr Track

Amagi is a proactive AI assistant that sees your screen, listens, remembers, and helps you stay focused—designed to run across devices with real-time context awareness

The Monad (AI-Smith Protocol) -Vultr Track

GroqLlama 4WhisperQdrant

AI Agent-Powered E-Commerce

An AI-powered shopping assistant built with FastAPI, Groq API (LLaMA models), and Neo4j knowledge graph for personalized e-commerce experiences

Hackcelerate - Prosus Track

GroqLlama 3

SnapChronicles - Qualcomm Track

Illuminative Lab - Qualcomm Track

GroqLlama 3.2