Skip to content

Cloud AI live transcription and translation service for OBS

License

Notifications You must be signed in to change notification settings

locaal-ai/obs-cloudvocal

Repository files navigation

CloudVocal - Professional Cloud AI Transcription & Translation for OBS

GitHub GitHub Workflow Status Total downloads GitHub release (latest by date) GitHub stars Discord
Download:

Introduction

CloudVocal brings professional-grade cloud transcription and translation to your OBS streams and recordings. Powered by industry-leading cloud providers, it delivers exceptional accuracy and real-time performance for your live streaming needs. ✅ Professional-grade accuracy, ✅ support for 100+ languages, ✅ enterprise-level reliability, and ✅ blazing-fast performance!

If this plugin has been valuable consider adding a ⭐ to this GH repo, rating it on OBS, subscribing to my YouTube channel where I post updates, and supporting my work on GitHub, Patreon 🙏

CloudVocal integrates seamlessly with leading cloud providers to deliver enterprise-grade speech recognition and translation services. Simply configure your API credentials and start streaming with professional-quality captions and translations.

Features

Current Features:

  • Professional-grade transcription with industry-leading accuracy
  • Real-time translation using enterprise cloud translation services
  • Support for 100+ languages with dialect recognition
  • Streaming-optimized performance with minimal latency
  • Multiple cloud provider options for transcription and translation
  • Caption output in multiple formats (.txt, .srt, .vtt)
  • Sync'ed captions with OBS recording timestamps
  • Direct streaming to platforms (YouTube, Twitch) with embedded captions
  • Advanced text filtering and customization options
  • Partial transcriptions for a streaming-captions experience
  • Custom vocabulary and pronunciation support
  • Professional terminology handling for specific industries

Roadmap:

  • Speaker diarization for multi-speaker environments
  • Advanced profanity filtering options
  • Custom translation glossaries
  • Additional subtitle format support
  • Enhanced analytics and caption quality metrics

Usage

Tutorial videos and screenshots - coming soon!

Download and Installation

Check out the latest releases for downloads and install instructions.

Configuration

  1. Download and install the appropriate version for your operating system
  2. Configure your cloud provider credentials in the plugin settings
  3. Select your desired transcription and translation options
  4. Add CloudVocal as a filter to your audio source

Building

The plugin can be built on Windows, macOS, and Linux platforms. The build process is straightforward as all processing happens in the cloud.

Mac OSX

$ MACOS_ARCH="x86_64" ./.github/scripts/build-macos -c Release

Linux

$ sudo apt install -y libssl-dev
$ ./.github/scripts/build-linux

Windows

> .github/scripts/Build-Windows.ps1 -Configuration Release

Check out our other plugins:

  • LocalVocal for on-device transcription and translation
  • Background Removal for removing background from live portrait video
  • Detect for real-time on-device object detection
  • CleanStream for real-time profanity filter
  • URL/API Source for real-time data integrations
  • Squawk for real-time on-device speech generation (text-to-speech)