Cline Speech Extension

A VS Code extension that provides Text-to-Speech (TTS) and Speech-to-Text (STT) functionality for Cline AI, using the speaches-ai/speaches API server.

Features

Text to Speech: Convert selected text or clipboard text to speech
Speech to Text: Transcribe audio files to text
File Output: Save TTS output as audio files
Context Menu Integration: Right-click context menu options
Editor Integration: Direct text insertion for STT results

Commands

Cline: Text to Speech (cline-speech.tts)
- Converts selected text or clipboard text to speech
- Plays the audio directly in your default media player
Cline: Speech to Text (cline-speech.stt)
- Transcribes audio files to text
- Inserts the transcribed text at cursor position
Cline: Text to Speech with File (cline-speech.ttsWithFile)
- Converts text to speech and saves as audio file
- Prompts for filename
Cline: Speech to Text from File (cline-speech.sttFromFile)
- Transcribes audio files to text and inserts result
- Prompts for audio file selection
Cline: Voice to Text (Record) (cline-speech.voiceToText)
- Records voice from microphone and transcribes to text
- Note: Microphone access requires proper permissions and may have platform limitations
- Inserts the transcribed text at cursor position

Setup

Prerequisites

Install the speaches-ai/speaches server:

git clone https://github.com/speaches-ai/speaches.git
cd speaches
docker-compose up -d

Make sure the server is running on http://speaches.lan:8000 (default)

Important Note about Server Compatibility

The speaches-ai/speaches server is a Gradio web application and may not expose direct REST API endpoints that this extension expects. If you encounter "404 Not Found" errors, please verify that:

The server is properly running
You're using the correct version of the speaches server that supports the required API endpoints
The server is configured to expose the necessary TTS/STT endpoints

Installation

Install the extension in VS Code:
- Download the .vsix file or build from source
- In VS Code: Extensions → Install from VSIX → select the file
Restart VS Code

Configuration

The extension can be configured through VS Code settings:

Open VS Code Settings (Ctrl+,)
Search for "cline speech"
Set the Cline Speech: Api Endpoint to your speaches server address

Default endpoint: http://speaches.lan:8000

Task Completion Alerts

The extension supports optional task completion audio alerts:

Enable the "Cline Speech: Task Completion Alert" setting
When enabled, the extension will play an audio notification saying "Task Completed" after successful operations
This provides audible feedback when tasks are completed by Cline

Usage

Select text in your editor or copy text to clipboard
Use one of the commands from the Command Palette (Ctrl+Shift+P) or context menu
For STT commands, select an audio file when prompted

API Endpoints

The extension communicates with the speaches server using these endpoints:

POST /v1/audio/speech - Text to Speech conversion (with proper JSON payload)
POST /v1/audio/transcriptions - Speech to Text conversion

Text to Speech API Format

The speaches server expects a specific JSON format for TTS requests:

{
  "input": "Hello World!",
  "model": "tts-1",
  "voice": "alloy",
  "response_format": "wav",
  "speed": 1.0
}

Speech to Text API Format

For STT, the extension sends base64-encoded audio data to the /v1/audio/transcriptions endpoint.

Contributing

Contributions are welcome! Please fork the repository and submit pull requests.

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
node_modules		node_modules
out		out
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SERVER_SETUP.md		SERVER_SETUP.md
build.sh		build.sh
cline-speech-0.0.1.vsix		cline-speech-0.0.1.vsix
package-lock.json		package-lock.json
package-server.json		package-server.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cline Speech Extension

Features

Commands

Setup

Prerequisites

Important Note about Server Compatibility

Installation

Configuration

Task Completion Alerts

Usage

API Endpoints

Text to Speech API Format

Speech to Text API Format

Contributing

License

About

Uh oh!

Releases

Packages

Languages

License

Mikec78660/cline-speech-extension

Folders and files

Latest commit

History

Repository files navigation

Cline Speech Extension

Features

Commands

Setup

Prerequisites

Important Note about Server Compatibility

Installation

Configuration

Task Completion Alerts

Usage

API Endpoints

Text to Speech API Format

Speech to Text API Format

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages