Third Eye - AI Assistant App for Blind People | Google Gemini API Developer Competition Submission #buildwithgemini
Click the image above to watch a demo of the Third Eye App on YouTube.
Third Eye is an innovative Android application designed to assist blind people by leveraging the power of Gemini AI. The app provides advanced functionalities to enhance accessibility and support daily tasks through voice commands and Gemini AI-powered features.
- Purpose: Allows you to enter a custom query or instruction.
- How to Use:
- Open the app and navigate to the custom prompt section.
- Speak or type your custom instruction or query.
- The AI Assistant will process your input and provide a response based on the query.
- Purpose: Combine a custom prompt with an image for more context-specific responses.
- How to Use:
- Open the app and go to the custom prompt and image upload section.
- Speak or type your custom instruction and upload an image.
- The AI Assistant will analyze the image in conjunction with your prompt to provide a relevant response.
- Purpose: Provides you a textual description of an uploaded or captured image.
- How to Use:
- Open the app and navigate to the image description feature.
- Upload or capture an image using the app's camera.
- The AI Assistant will generate a descriptive text summarizing the content of the image.
- Purpose: Extracts and displays text from an image.
- How to Use:
- Open the app and go to the image-to-text conversion section.
- Upload or capture an image that contains text.
- The AI Assistant will process the image and provide the extracted text for you to view.
- Kotlin
- Jetpack Compose for UI development
- Navigation
- Camera
- Koin for Dependency Injection
- Generative AI for All AI-related stuff
- Gemini API - API client for interacting with the Google Gemini AI.
- Splash API - API for adding a splash screen to your app.
- Lifecycle and ViewModel - Provides lifecycle-aware ViewModel support for Jetpack Compose.
- Navigation - Jetpack Navigation library for handling in-app navigation in Jetpack Compose.
- Koin - Kotlin dependency injection library with multiplatform support.
- CameraX - A set of libraries for integrating and managing camera functionalities in Android apps.
- Markdown - Library for rendering Markdown content in Jetpack Compose.
- Clone the repository:
git clone https://github.com/Coding-Meet/Third-Eye.git
- Open in the latest version of Android Studio or intellij idea.
- Before running the project, obtain an API key from Google AI to communicate with the Gemini API.
- Add a
local.properties
file to the project root. - Place your Gemini API key in
local.properties
file asGemini_API_KEY
property.
Gemini_API_KEY="YOUR_API_KEY"
- Sync the project with Gradle files in Android Studio or IntelliJ IDEA to ensure all dependencies are downloaded.
- Build the project by selecting
Build
>Rebuild Project
from the menu. - Run the app on an emulator or physical device by selecting
Run
>Run 'app'
from the menu.
Your generosity is greatly appreciated! Thank you for supporting this project.
Meet