Skip to content

Latest commit

 

History

History
118 lines (93 loc) · 5.1 KB

README.md

File metadata and controls

118 lines (93 loc) · 5.1 KB

Third Eye - AI Assistant App for Blind People | Google Gemini API Developer Competition Submission #buildwithgemini

Third Eye App Preview

Click the image above to watch a demo of the Third Eye App on YouTube.

Third Eye is an innovative Android application designed to assist blind people by leveraging the power of Gemini AI. The app provides advanced functionalities to enhance accessibility and support daily tasks through voice commands and Gemini AI-powered features.

Features:

1. Custom Prompt:

  • Purpose: Allows you to enter a custom query or instruction.
  • How to Use:
    1. Open the app and navigate to the custom prompt section.
    2. Speak or type your custom instruction or query.
    3. The AI Assistant will process your input and provide a response based on the query.

2. Custom Prompt with Image:

  • Purpose: Combine a custom prompt with an image for more context-specific responses.
  • How to Use:
    1. Open the app and go to the custom prompt and image upload section.
    2. Speak or type your custom instruction and upload an image.
    3. The AI Assistant will analyze the image in conjunction with your prompt to provide a relevant response.

3. Describe Image:

  • Purpose: Provides you a textual description of an uploaded or captured image.
  • How to Use:
    1. Open the app and navigate to the image description feature.
    2. Upload or capture an image using the app's camera.
    3. The AI Assistant will generate a descriptive text summarizing the content of the image.

4. Image to Text:

  • Purpose: Extracts and displays text from an image.
  • How to Use:
    1. Open the app and go to the image-to-text conversion section.
    2. Upload or capture an image that contains text.
    3. The AI Assistant will process the image and provide the extracted text for you to view.

Technologies:

  • Kotlin
  • Jetpack Compose for UI development
  • Navigation
  • Camera
  • Koin for Dependency Injection
  • Generative AI for All AI-related stuff

Libraries:

  • Gemini API - API client for interacting with the Google Gemini AI.
  • Splash API - API for adding a splash screen to your app.
  • Lifecycle and ViewModel - Provides lifecycle-aware ViewModel support for Jetpack Compose.
  • Navigation - Jetpack Navigation library for handling in-app navigation in Jetpack Compose.
  • Koin - Kotlin dependency injection library with multiplatform support.
  • CameraX - A set of libraries for integrating and managing camera functionalities in Android apps.
  • Markdown - Library for rendering Markdown content in Jetpack Compose.

Installation:

  1. Clone the repository:
    git clone https://github.com/Coding-Meet/Third-Eye.git
  2. Open in the latest version of Android Studio or intellij idea.
  3. Before running the project, obtain an API key from Google AI to communicate with the Gemini API.
  4. Add a local.properties file to the project root.
  5. Place your Gemini API key in local.properties file as Gemini_API_KEY property.
Gemini_API_KEY="YOUR_API_KEY"
  1. Sync the project with Gradle files in Android Studio or IntelliJ IDEA to ensure all dependencies are downloaded.
  2. Build the project by selecting Build > Rebuild Project from the menu.
  3. Run the app on an emulator or physical device by selecting Run > Run 'app' from the menu.

Screenshot:

android android
android android
android android
android android
android android
android android

Your generosity is greatly appreciated! Thank you for supporting this project.

Connect with me

Author

Meet