@mastra/voice-azure 0.1.0-alpha.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -0,0 +1,23 @@
1
+
2
+ > @mastra/voice-azure@0.1.0-alpha.1 build /home/runner/work/mastra/mastra/voice/azure
3
+ > tsup src/index.ts --format esm,cjs --experimental-dts --clean --treeshake=smallest --splitting
4
+
5
+ CLI Building entry: src/index.ts
6
+ CLI Using tsconfig: tsconfig.json
7
+ CLI tsup v8.4.0
8
+ TSC Build start
9
+ TSC ⚡️ Build success in 7826ms
10
+ DTS Build start
11
+ CLI Target: es2022
12
+ Analysis will use the bundled TypeScript version 5.8.2
13
+ Writing package typings: /home/runner/work/mastra/mastra/voice/azure/dist/_tsup-dts-rollup.d.ts
14
+ Analysis will use the bundled TypeScript version 5.8.2
15
+ Writing package typings: /home/runner/work/mastra/mastra/voice/azure/dist/_tsup-dts-rollup.d.cts
16
+ DTS ⚡️ Build success in 12425ms
17
+ CLI Cleaning output folder
18
+ ESM Build start
19
+ CJS Build start
20
+ CJS dist/index.cjs 12.79 KB
21
+ CJS ⚡️ Build success in 488ms
22
+ ESM dist/index.js 12.16 KB
23
+ ESM ⚡️ Build success in 488ms
package/CHANGELOG.md ADDED
@@ -0,0 +1,9 @@
1
+ # @mastra/voice-azure
2
+
3
+ ## 0.1.0-alpha.1
4
+
5
+ ### Patch Changes
6
+
7
+ - ec9fa6a: This package provides both Text-to-Speech (TTS) and Speech-to-Text (STT) capabilities through Azure API
8
+ - Updated dependencies [6794797]
9
+ - @mastra/core@0.6.4-alpha.1
package/LICENSE ADDED
@@ -0,0 +1,44 @@
1
+ Elastic License 2.0 (ELv2)
2
+
3
+ **Acceptance**
4
+ By using the software, you agree to all of the terms and conditions below.
5
+
6
+ **Copyright License**
7
+ The licensor grants you a non-exclusive, royalty-free, worldwide, non-sublicensable, non-transferable license to use, copy, distribute, make available, and prepare derivative works of the software, in each case subject to the limitations and conditions below
8
+
9
+ **Limitations**
10
+ You may not provide the software to third parties as a hosted or managed service, where the service provides users with access to any substantial set of the features or functionality of the software.
11
+
12
+ You may not move, change, disable, or circumvent the license key functionality in the software, and you may not remove or obscure any functionality in the software that is protected by the license key.
13
+
14
+ You may not alter, remove, or obscure any licensing, copyright, or other notices of the licensor in the software. Any use of the licensor’s trademarks is subject to applicable law.
15
+
16
+ **Patents**
17
+ The licensor grants you a license, under any patent claims the licensor can license, or becomes able to license, to make, have made, use, sell, offer for sale, import and have imported the software, in each case subject to the limitations and conditions in this license. This license does not cover any patent claims that you cause to be infringed by modifications or additions to the software. If you or your company make any written claim that the software infringes or contributes to infringement of any patent, your patent license for the software granted under these terms ends immediately. If your company makes such a claim, your patent license ends immediately for work on behalf of your company.
18
+
19
+ **Notices**
20
+ You must ensure that anyone who gets a copy of any part of the software from you also gets a copy of these terms.
21
+
22
+ If you modify the software, you must include in any modified copies of the software prominent notices stating that you have modified the software.
23
+
24
+ **No Other Rights**
25
+ These terms do not imply any licenses other than those expressly granted in these terms.
26
+
27
+ **Termination**
28
+ If you use the software in violation of these terms, such use is not licensed, and your licenses will automatically terminate. If the licensor provides you with a notice of your violation, and you cease all violation of this license no later than 30 days after you receive that notice, your licenses will be reinstated retroactively. However, if you violate these terms after such reinstatement, any additional violation of these terms will cause your licenses to terminate automatically and permanently.
29
+
30
+ **No Liability**
31
+ As far as the law allows, the software comes as is, without any warranty or condition, and the licensor will not be liable to you for any damages arising out of these terms or the use or nature of the software, under any kind of legal claim.
32
+
33
+ **Definitions**
34
+ The _licensor_ is the entity offering these terms, and the _software_ is the software the licensor makes available under these terms, including any portion of it.
35
+
36
+ _you_ refers to the individual or entity agreeing to these terms.
37
+
38
+ _your company_ is any legal entity, sole proprietorship, or other kind of organization that you work for, plus all organizations that have control over, are under the control of, or are under common control with that organization. _control_ means ownership of substantially all the assets of an entity, or the power to direct its management and policies by vote, contract, or otherwise. Control can be direct or indirect.
39
+
40
+ _your licenses_ are all the licenses granted to you for the software under these terms.
41
+
42
+ _use_ means anything you do with the software requiring one of your licenses.
43
+
44
+ _trademark_ means trademarks, service marks, and similar rights.
package/README.md ADDED
@@ -0,0 +1,78 @@
1
+ # @mastra/voice-azure
2
+
3
+ Azure Voice integration for Mastra, providing both Text-to-Speech (TTS) and Speech-to-Text (STT) capabilities using Azure's Cognitive Services Speech SDK.
4
+
5
+ ## Installation
6
+
7
+ ```bash
8
+ npm install @mastra/voice-azure
9
+ ```
10
+
11
+ ## Configuration
12
+
13
+ The module requires Azure Speech Services credentials that can be provided through environment variables or directly in the configuration:
14
+
15
+ ```bash
16
+ AZURE_API_KEY=your_speech_service_key
17
+ AZURE_REGION=your_azure_region
18
+ ```
19
+
20
+ ## Usage
21
+
22
+ ```typescript
23
+ import { AzureVoice } from '@mastra/voice-azure';
24
+
25
+ // Create voice with both speech and listening capabilities
26
+ const voice = new AzureVoice({
27
+ speechModel: {
28
+ apiKey: 'your-api-key', // Optional, can use AZURE_API_KEY env var
29
+ region: 'your-region', // Optional, can use AZURE_REGION env var
30
+ voiceName: 'en-US-AriaNeural', // Optional, default voice
31
+ },
32
+ listeningModel: {
33
+ apiKey: 'your-api-key', // Optional, can use AZURE_API_KEY env var
34
+ region: 'your-region', // Optional, can use AZURE_REGION env var
35
+ language: 'en-US', // Optional, recognition language
36
+ },
37
+ });
38
+
39
+ // List available voices
40
+ const voices = await voice.getSpeakers();
41
+
42
+ // Generate speech
43
+ const audioStream = await voice.speak('Hello from Mastra!', {
44
+ speaker: 'en-US-JennyNeural', // Optional: override default voice
45
+ });
46
+
47
+ // Convert speech to text
48
+ const text = await voice.listen(audioStream);
49
+ ```
50
+
51
+ ## Features
52
+
53
+ - High-quality neural Text-to-Speech synthesis
54
+ - Accurate Speech-to-Text recognition
55
+ - 200+ neural voices across multiple languages
56
+ - SSML support
57
+ - Real-time audio streaming
58
+ - Multiple audio format support
59
+
60
+ ## Voice Options
61
+
62
+ Azure provides numerous neural voices across different languages. Here are some popular English voices:
63
+
64
+ - en-US-JennyNeural (Female)
65
+ - en-US-GuyNeural (Male)
66
+ - en-US-AriaNeural (Female)
67
+ - en-US-DavisNeural (Male)
68
+ - en-GB-SoniaNeural (Female)
69
+ - en-GB-RyanNeural (Male)
70
+ - en-AU-NatashaNeural (Female)
71
+ - en-AU-WilliamNeural (Male)
72
+
73
+ Each voice ID follows the format: `{language}-{region}-{name}Neural`
74
+
75
+ For a complete list of supported voices, you can:
76
+
77
+ 1. Call the `getSpeakers()` method
78
+ 2. View the [Azure Neural TTS documentation](https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/language-support?tabs=tts)
@@ -0,0 +1,64 @@
1
+ import { MastraVoice } from '@mastra/core/voice';
2
+
3
+ export declare const AZURE_VOICES: readonly ["af-ZA-AdriNeural", "af-ZA-WillemNeural", "am-ET-MekdesNeural", "am-ET-AmehaNeural", "ar-AE-FatimaNeural", "ar-AE-HamdanNeural", "ar-BH-LailaNeural", "ar-BH-AliNeural", "ar-DZ-AminaNeural", "ar-DZ-IsmaelNeural", "ar-EG-SalmaNeural", "ar-EG-ShakirNeural", "ar-IQ-RanaNeural", "ar-IQ-BasselNeural", "ar-JO-SanaNeural", "ar-JO-TaimNeural", "ar-KW-NouraNeural", "ar-KW-FahedNeural", "ar-LB-LaylaNeural", "ar-LB-RamiNeural", "ar-LY-ImanNeural", "ar-LY-OmarNeural", "ar-MA-MounaNeural", "ar-MA-JamalNeural", "ar-OM-AyshaNeural", "ar-OM-AbdullahNeural", "ar-QA-AmalNeural", "ar-QA-MoazNeural", "ar-SA-ZariyahNeural", "ar-SA-HamedNeural", "ar-SY-AmanyNeural", "ar-SY-LaithNeural", "ar-TN-ReemNeural", "ar-TN-HediNeural", "ar-YE-MaryamNeural", "ar-YE-SalehNeural", "as-IN-YashicaNeural", "as-IN-PriyomNeural", "az-AZ-BanuNeural", "az-AZ-BabekNeural", "bg-BG-KalinaNeural", "bg-BG-BorislavNeural", "bn-BD-NabanitaNeural", "bn-BD-PradeepNeural", "bn-IN-TanishaaNeural", "bn-IN-BashkarNeural", "bs-BA-VesnaNeural", "bs-BA-GoranNeural", "ca-ES-JoanaNeural", "ca-ES-EnricNeural", "ca-ES-AlbaNeural", "cs-CZ-VlastaNeural", "cs-CZ-AntoninNeural", "cy-GB-NiaNeural", "cy-GB-AledNeural", "da-DK-ChristelNeural", "da-DK-JeppeNeural", "de-AT-IngridNeural", "de-AT-JonasNeural", "de-CH-LeniNeural", "de-CH-JanNeural", "de-DE-KatjaNeural", "de-DE-ConradNeural", "de-DE-SeraphinaMultilingualNeural", "de-DE-FlorianMultilingualNeural", "de-DE-AmalaNeural", "de-DE-BerndNeural", "de-DE-ChristophNeural", "de-DE-ElkeNeural", "de-DE-GiselaNeural", "de-DE-KasperNeural", "de-DE-KillianNeural", "de-DE-KlarissaNeural", "de-DE-KlausNeural", "de-DE-LouisaNeural", "de-DE-MajaNeural", "de-DE-RalfNeural", "de-DE-TanjaNeural", "de-DE-Seraphina:DragonHDLatestNeural", "el-GR-AthinaNeural", "el-GR-NestorasNeural", "en-AU-NatashaNeural", "en-AU-WilliamNeural", "en-AU-AnnetteNeural", "en-AU-CarlyNeural", "en-AU-DarrenNeural", "en-AU-DuncanNeural", "en-AU-ElsieNeural", "en-AU-FreyaNeural", "en-AU-JoanneNeural", "en-AU-KenNeural", "en-AU-KimNeural", "en-AU-NeilNeural", "en-AU-TimNeural", "en-AU-TinaNeural", "en-CA-ClaraNeural", "en-CA-LiamNeural", "en-GB-SoniaNeural", "en-GB-RyanNeural", "en-GB-LibbyNeural", "en-GB-AdaMultilingualNeural", "en-GB-OllieMultilingualNeural", "en-GB-AbbiNeural", "en-GB-AlfieNeural", "en-GB-BellaNeural", "en-GB-ElliotNeural", "en-GB-EthanNeural", "en-GB-HollieNeural", "en-GB-MaisieNeural", "en-GB-NoahNeural", "en-GB-OliverNeural", "en-GB-OliviaNeural", "en-GB-ThomasNeural", "en-GB-MiaNeural", "en-HK-YanNeural", "en-HK-SamNeural", "en-IE-EmilyNeural", "en-IE-ConnorNeural", "en-IN-AaravNeural", "en-IN-AashiNeural", "en-IN-AnanyaNeural", "en-IN-KavyaNeural", "en-IN-KunalNeural", "en-IN-NeerjaNeural", "en-IN-PrabhatNeural", "en-IN-RehaanNeural", "en-IN-AartiNeural", "en-IN-ArjunNeural", "en-KE-AsiliaNeural", "en-KE-ChilembaNeural", "en-NG-EzinneNeural", "en-NG-AbeoNeural", "en-NZ-MollyNeural", "en-NZ-MitchellNeural", "en-PH-RosaNeural", "en-PH-JamesNeural", "en-SG-LunaNeural", "en-SG-WayneNeural", "en-TZ-ImaniNeural", "en-TZ-ElimuNeural", "en-US-AvaMultilingualNeural", "en-US-AndrewMultilingualNeural", "en-US-EmmaMultilingualNeural", "en-US-BrianMultilingualNeural", "en-US-AvaNeural", "en-US-AndrewNeural", "en-US-EmmaNeural", "en-US-BrianNeural", "en-US-JennyNeural", "en-US-GuyNeural", "en-US-AriaNeural", "en-US-DavisNeural", "en-US-JaneNeural", "en-US-JasonNeural", "en-US-KaiNeural", "en-US-LunaNeural", "en-US-SaraNeural", "en-US-TonyNeural", "en-US-NancyNeural", "en-US-CoraMultilingualNeural", "en-US-ChristopherMultilingualNeural", "en-US-BrandonMultilingualNeural", "en-US-AmberNeural", "en-US-AnaNeural", "en-US-AshleyNeural", "en-US-BrandonNeural", "en-US-ChristopherNeural", "en-US-CoraNeural", "en-US-ElizabethNeural", "en-US-EricNeural", "en-US-JacobNeural", "en-US-JennyMultilingualNeural", "en-US-MichelleNeural", "en-US-MonicaNeural", "en-US-RogerNeural", "en-US-RyanMultilingualNeural", "en-US-SteffanNeural", "en-US-AdamMultilingualNeural", "en-US-AIGenerate1Neural", "en-US-AIGenerate2Neural", "en-US-AlloyTurboMultilingualNeural", "en-US-AmandaMultilingualNeural", "en-US-BlueNeural", "en-US-DavisMultilingualNeural", "en-US-DerekMultilingualNeural", "en-US-DustinMultilingualNeural", "en-US-EchoTurboMultilingualNeural", "en-US-EvelynMultilingualNeural", "en-US-FableTurboMultilingualNeural", "en-US-LewisMultilingualNeural", "en-US-LolaMultilingualNeural", "en-US-NancyMultilingualNeural", "en-US-NovaTurboMultilingualNeural", "en-US-OnyxTurboMultilingualNeural", "en-US-PhoebeMultilingualNeural", "en-US-SamuelMultilingualNeural", "en-US-SerenaMultilingualNeural", "en-US-ShimmerTurboMultilingualNeural", "en-US-SteffanMultilingualNeural", "en-US-Andrew:DragonHDLatestNeural", "en-US-Andrew2:DragonHDLatestNeural", "en-US-Aria:DragonHDLatestNeural", "en-US-Ava:DragonHDLatestNeural", "en-US-Brian:DragonHDLatestNeural", "en-US-Davis:DragonHDLatestNeural", "en-US-Emma:DragonHDLatestNeural", "en-US-Emma2:DragonHDLatestNeural", "en-US-Jenny:DragonHDLatestNeural", "en-US-Steffan:DragonHDLatestNeural", "en-ZA-LeahNeural", "en-ZA-LukeNeural"];
4
+
5
+ export declare class AzureVoice extends MastraVoice {
6
+ private speechConfig?;
7
+ private listeningConfig?;
8
+ private speechSynthesizer?;
9
+ private speechRecognizer?;
10
+ /**
11
+ * Creates a new instance of AzureVoice for text-to-speech and speech-to-text services.
12
+ *
13
+ * @param {Object} config - Configuration options
14
+ * @param {AzureVoiceConfig} [config.speechModel] - Configuration for text-to-speech
15
+ * @param {AzureVoiceConfig} [config.listeningModel] - Configuration for speech-to-text
16
+ * @param {VoiceId} [config.speaker] - Default voice ID for speech synthesis
17
+ */
18
+ constructor({ speechModel, listeningModel, speaker, }?: {
19
+ speechModel?: AzureVoiceConfig;
20
+ listeningModel?: AzureVoiceConfig;
21
+ speaker?: VoiceId;
22
+ });
23
+ /**
24
+ * Gets a list of available voices for speech synthesis.
25
+ *
26
+ * @returns {Promise<Array<{ voiceId: string; language: string; region: string; }>>} List of available voices
27
+ */
28
+ getSpeakers(): Promise<{
29
+ voiceId: "af-ZA-AdriNeural" | "af-ZA-WillemNeural" | "am-ET-MekdesNeural" | "am-ET-AmehaNeural" | "ar-AE-FatimaNeural" | "ar-AE-HamdanNeural" | "ar-BH-LailaNeural" | "ar-BH-AliNeural" | "ar-DZ-AminaNeural" | "ar-DZ-IsmaelNeural" | "ar-EG-SalmaNeural" | "ar-EG-ShakirNeural" | "ar-IQ-RanaNeural" | "ar-IQ-BasselNeural" | "ar-JO-SanaNeural" | "ar-JO-TaimNeural" | "ar-KW-NouraNeural" | "ar-KW-FahedNeural" | "ar-LB-LaylaNeural" | "ar-LB-RamiNeural" | "ar-LY-ImanNeural" | "ar-LY-OmarNeural" | "ar-MA-MounaNeural" | "ar-MA-JamalNeural" | "ar-OM-AyshaNeural" | "ar-OM-AbdullahNeural" | "ar-QA-AmalNeural" | "ar-QA-MoazNeural" | "ar-SA-ZariyahNeural" | "ar-SA-HamedNeural" | "ar-SY-AmanyNeural" | "ar-SY-LaithNeural" | "ar-TN-ReemNeural" | "ar-TN-HediNeural" | "ar-YE-MaryamNeural" | "ar-YE-SalehNeural" | "as-IN-YashicaNeural" | "as-IN-PriyomNeural" | "az-AZ-BanuNeural" | "az-AZ-BabekNeural" | "bg-BG-KalinaNeural" | "bg-BG-BorislavNeural" | "bn-BD-NabanitaNeural" | "bn-BD-PradeepNeural" | "bn-IN-TanishaaNeural" | "bn-IN-BashkarNeural" | "bs-BA-VesnaNeural" | "bs-BA-GoranNeural" | "ca-ES-JoanaNeural" | "ca-ES-EnricNeural" | "ca-ES-AlbaNeural" | "cs-CZ-VlastaNeural" | "cs-CZ-AntoninNeural" | "cy-GB-NiaNeural" | "cy-GB-AledNeural" | "da-DK-ChristelNeural" | "da-DK-JeppeNeural" | "de-AT-IngridNeural" | "de-AT-JonasNeural" | "de-CH-LeniNeural" | "de-CH-JanNeural" | "de-DE-KatjaNeural" | "de-DE-ConradNeural" | "de-DE-SeraphinaMultilingualNeural" | "de-DE-FlorianMultilingualNeural" | "de-DE-AmalaNeural" | "de-DE-BerndNeural" | "de-DE-ChristophNeural" | "de-DE-ElkeNeural" | "de-DE-GiselaNeural" | "de-DE-KasperNeural" | "de-DE-KillianNeural" | "de-DE-KlarissaNeural" | "de-DE-KlausNeural" | "de-DE-LouisaNeural" | "de-DE-MajaNeural" | "de-DE-RalfNeural" | "de-DE-TanjaNeural" | "de-DE-Seraphina:DragonHDLatestNeural" | "el-GR-AthinaNeural" | "el-GR-NestorasNeural" | "en-AU-NatashaNeural" | "en-AU-WilliamNeural" | "en-AU-AnnetteNeural" | "en-AU-CarlyNeural" | "en-AU-DarrenNeural" | "en-AU-DuncanNeural" | "en-AU-ElsieNeural" | "en-AU-FreyaNeural" | "en-AU-JoanneNeural" | "en-AU-KenNeural" | "en-AU-KimNeural" | "en-AU-NeilNeural" | "en-AU-TimNeural" | "en-AU-TinaNeural" | "en-CA-ClaraNeural" | "en-CA-LiamNeural" | "en-GB-SoniaNeural" | "en-GB-RyanNeural" | "en-GB-LibbyNeural" | "en-GB-AdaMultilingualNeural" | "en-GB-OllieMultilingualNeural" | "en-GB-AbbiNeural" | "en-GB-AlfieNeural" | "en-GB-BellaNeural" | "en-GB-ElliotNeural" | "en-GB-EthanNeural" | "en-GB-HollieNeural" | "en-GB-MaisieNeural" | "en-GB-NoahNeural" | "en-GB-OliverNeural" | "en-GB-OliviaNeural" | "en-GB-ThomasNeural" | "en-GB-MiaNeural" | "en-HK-YanNeural" | "en-HK-SamNeural" | "en-IE-EmilyNeural" | "en-IE-ConnorNeural" | "en-IN-AaravNeural" | "en-IN-AashiNeural" | "en-IN-AnanyaNeural" | "en-IN-KavyaNeural" | "en-IN-KunalNeural" | "en-IN-NeerjaNeural" | "en-IN-PrabhatNeural" | "en-IN-RehaanNeural" | "en-IN-AartiNeural" | "en-IN-ArjunNeural" | "en-KE-AsiliaNeural" | "en-KE-ChilembaNeural" | "en-NG-EzinneNeural" | "en-NG-AbeoNeural" | "en-NZ-MollyNeural" | "en-NZ-MitchellNeural" | "en-PH-RosaNeural" | "en-PH-JamesNeural" | "en-SG-LunaNeural" | "en-SG-WayneNeural" | "en-TZ-ImaniNeural" | "en-TZ-ElimuNeural" | "en-US-AvaMultilingualNeural" | "en-US-AndrewMultilingualNeural" | "en-US-EmmaMultilingualNeural" | "en-US-BrianMultilingualNeural" | "en-US-AvaNeural" | "en-US-AndrewNeural" | "en-US-EmmaNeural" | "en-US-BrianNeural" | "en-US-JennyNeural" | "en-US-GuyNeural" | "en-US-AriaNeural" | "en-US-DavisNeural" | "en-US-JaneNeural" | "en-US-JasonNeural" | "en-US-KaiNeural" | "en-US-LunaNeural" | "en-US-SaraNeural" | "en-US-TonyNeural" | "en-US-NancyNeural" | "en-US-CoraMultilingualNeural" | "en-US-ChristopherMultilingualNeural" | "en-US-BrandonMultilingualNeural" | "en-US-AmberNeural" | "en-US-AnaNeural" | "en-US-AshleyNeural" | "en-US-BrandonNeural" | "en-US-ChristopherNeural" | "en-US-CoraNeural" | "en-US-ElizabethNeural" | "en-US-EricNeural" | "en-US-JacobNeural" | "en-US-JennyMultilingualNeural" | "en-US-MichelleNeural" | "en-US-MonicaNeural" | "en-US-RogerNeural" | "en-US-RyanMultilingualNeural" | "en-US-SteffanNeural" | "en-US-AdamMultilingualNeural" | "en-US-AIGenerate1Neural" | "en-US-AIGenerate2Neural" | "en-US-AlloyTurboMultilingualNeural" | "en-US-AmandaMultilingualNeural" | "en-US-BlueNeural" | "en-US-DavisMultilingualNeural" | "en-US-DerekMultilingualNeural" | "en-US-DustinMultilingualNeural" | "en-US-EchoTurboMultilingualNeural" | "en-US-EvelynMultilingualNeural" | "en-US-FableTurboMultilingualNeural" | "en-US-LewisMultilingualNeural" | "en-US-LolaMultilingualNeural" | "en-US-NancyMultilingualNeural" | "en-US-NovaTurboMultilingualNeural" | "en-US-OnyxTurboMultilingualNeural" | "en-US-PhoebeMultilingualNeural" | "en-US-SamuelMultilingualNeural" | "en-US-SerenaMultilingualNeural" | "en-US-ShimmerTurboMultilingualNeural" | "en-US-SteffanMultilingualNeural" | "en-US-Andrew:DragonHDLatestNeural" | "en-US-Andrew2:DragonHDLatestNeural" | "en-US-Aria:DragonHDLatestNeural" | "en-US-Ava:DragonHDLatestNeural" | "en-US-Brian:DragonHDLatestNeural" | "en-US-Davis:DragonHDLatestNeural" | "en-US-Emma:DragonHDLatestNeural" | "en-US-Emma2:DragonHDLatestNeural" | "en-US-Jenny:DragonHDLatestNeural" | "en-US-Steffan:DragonHDLatestNeural" | "en-ZA-LeahNeural" | "en-ZA-LukeNeural";
30
+ language: string | undefined;
31
+ region: string | undefined;
32
+ }[]>;
33
+ /**
34
+ * Converts text to speech using Azure's Text-to-Speech service.
35
+ *
36
+ * @param {string | NodeJS.ReadableStream} input - Text to convert to speech
37
+ * @param {Object} [options] - Optional parameters
38
+ * @param {string} [options.speaker] - Voice ID to use for synthesis
39
+ * @returns {Promise<NodeJS.ReadableStream>} Stream containing the synthesized audio
40
+ * @throws {Error} If speech model is not configured or synthesis fails
41
+ */
42
+ speak(input: string | NodeJS.ReadableStream, options?: {
43
+ speaker?: string;
44
+ [key: string]: any;
45
+ }): Promise<NodeJS.ReadableStream>;
46
+ /**
47
+ * Transcribes audio (STT) from a Node.js stream using Azure.
48
+ *
49
+ * @param {NodeJS.ReadableStream} audioStream - The audio to be transcribed, must be in .wav format.
50
+ * @returns {Promise<string>} - The recognized text.
51
+ */
52
+ listen(audioStream: NodeJS.ReadableStream): Promise<string>;
53
+ }
54
+
55
+ declare interface AzureVoiceConfig {
56
+ apiKey?: string;
57
+ region?: string;
58
+ voiceName?: string;
59
+ language?: string;
60
+ }
61
+
62
+ export declare type VoiceId = (typeof AZURE_VOICES)[number];
63
+
64
+ export { }
@@ -0,0 +1,64 @@
1
+ import { MastraVoice } from '@mastra/core/voice';
2
+
3
+ export declare const AZURE_VOICES: readonly ["af-ZA-AdriNeural", "af-ZA-WillemNeural", "am-ET-MekdesNeural", "am-ET-AmehaNeural", "ar-AE-FatimaNeural", "ar-AE-HamdanNeural", "ar-BH-LailaNeural", "ar-BH-AliNeural", "ar-DZ-AminaNeural", "ar-DZ-IsmaelNeural", "ar-EG-SalmaNeural", "ar-EG-ShakirNeural", "ar-IQ-RanaNeural", "ar-IQ-BasselNeural", "ar-JO-SanaNeural", "ar-JO-TaimNeural", "ar-KW-NouraNeural", "ar-KW-FahedNeural", "ar-LB-LaylaNeural", "ar-LB-RamiNeural", "ar-LY-ImanNeural", "ar-LY-OmarNeural", "ar-MA-MounaNeural", "ar-MA-JamalNeural", "ar-OM-AyshaNeural", "ar-OM-AbdullahNeural", "ar-QA-AmalNeural", "ar-QA-MoazNeural", "ar-SA-ZariyahNeural", "ar-SA-HamedNeural", "ar-SY-AmanyNeural", "ar-SY-LaithNeural", "ar-TN-ReemNeural", "ar-TN-HediNeural", "ar-YE-MaryamNeural", "ar-YE-SalehNeural", "as-IN-YashicaNeural", "as-IN-PriyomNeural", "az-AZ-BanuNeural", "az-AZ-BabekNeural", "bg-BG-KalinaNeural", "bg-BG-BorislavNeural", "bn-BD-NabanitaNeural", "bn-BD-PradeepNeural", "bn-IN-TanishaaNeural", "bn-IN-BashkarNeural", "bs-BA-VesnaNeural", "bs-BA-GoranNeural", "ca-ES-JoanaNeural", "ca-ES-EnricNeural", "ca-ES-AlbaNeural", "cs-CZ-VlastaNeural", "cs-CZ-AntoninNeural", "cy-GB-NiaNeural", "cy-GB-AledNeural", "da-DK-ChristelNeural", "da-DK-JeppeNeural", "de-AT-IngridNeural", "de-AT-JonasNeural", "de-CH-LeniNeural", "de-CH-JanNeural", "de-DE-KatjaNeural", "de-DE-ConradNeural", "de-DE-SeraphinaMultilingualNeural", "de-DE-FlorianMultilingualNeural", "de-DE-AmalaNeural", "de-DE-BerndNeural", "de-DE-ChristophNeural", "de-DE-ElkeNeural", "de-DE-GiselaNeural", "de-DE-KasperNeural", "de-DE-KillianNeural", "de-DE-KlarissaNeural", "de-DE-KlausNeural", "de-DE-LouisaNeural", "de-DE-MajaNeural", "de-DE-RalfNeural", "de-DE-TanjaNeural", "de-DE-Seraphina:DragonHDLatestNeural", "el-GR-AthinaNeural", "el-GR-NestorasNeural", "en-AU-NatashaNeural", "en-AU-WilliamNeural", "en-AU-AnnetteNeural", "en-AU-CarlyNeural", "en-AU-DarrenNeural", "en-AU-DuncanNeural", "en-AU-ElsieNeural", "en-AU-FreyaNeural", "en-AU-JoanneNeural", "en-AU-KenNeural", "en-AU-KimNeural", "en-AU-NeilNeural", "en-AU-TimNeural", "en-AU-TinaNeural", "en-CA-ClaraNeural", "en-CA-LiamNeural", "en-GB-SoniaNeural", "en-GB-RyanNeural", "en-GB-LibbyNeural", "en-GB-AdaMultilingualNeural", "en-GB-OllieMultilingualNeural", "en-GB-AbbiNeural", "en-GB-AlfieNeural", "en-GB-BellaNeural", "en-GB-ElliotNeural", "en-GB-EthanNeural", "en-GB-HollieNeural", "en-GB-MaisieNeural", "en-GB-NoahNeural", "en-GB-OliverNeural", "en-GB-OliviaNeural", "en-GB-ThomasNeural", "en-GB-MiaNeural", "en-HK-YanNeural", "en-HK-SamNeural", "en-IE-EmilyNeural", "en-IE-ConnorNeural", "en-IN-AaravNeural", "en-IN-AashiNeural", "en-IN-AnanyaNeural", "en-IN-KavyaNeural", "en-IN-KunalNeural", "en-IN-NeerjaNeural", "en-IN-PrabhatNeural", "en-IN-RehaanNeural", "en-IN-AartiNeural", "en-IN-ArjunNeural", "en-KE-AsiliaNeural", "en-KE-ChilembaNeural", "en-NG-EzinneNeural", "en-NG-AbeoNeural", "en-NZ-MollyNeural", "en-NZ-MitchellNeural", "en-PH-RosaNeural", "en-PH-JamesNeural", "en-SG-LunaNeural", "en-SG-WayneNeural", "en-TZ-ImaniNeural", "en-TZ-ElimuNeural", "en-US-AvaMultilingualNeural", "en-US-AndrewMultilingualNeural", "en-US-EmmaMultilingualNeural", "en-US-BrianMultilingualNeural", "en-US-AvaNeural", "en-US-AndrewNeural", "en-US-EmmaNeural", "en-US-BrianNeural", "en-US-JennyNeural", "en-US-GuyNeural", "en-US-AriaNeural", "en-US-DavisNeural", "en-US-JaneNeural", "en-US-JasonNeural", "en-US-KaiNeural", "en-US-LunaNeural", "en-US-SaraNeural", "en-US-TonyNeural", "en-US-NancyNeural", "en-US-CoraMultilingualNeural", "en-US-ChristopherMultilingualNeural", "en-US-BrandonMultilingualNeural", "en-US-AmberNeural", "en-US-AnaNeural", "en-US-AshleyNeural", "en-US-BrandonNeural", "en-US-ChristopherNeural", "en-US-CoraNeural", "en-US-ElizabethNeural", "en-US-EricNeural", "en-US-JacobNeural", "en-US-JennyMultilingualNeural", "en-US-MichelleNeural", "en-US-MonicaNeural", "en-US-RogerNeural", "en-US-RyanMultilingualNeural", "en-US-SteffanNeural", "en-US-AdamMultilingualNeural", "en-US-AIGenerate1Neural", "en-US-AIGenerate2Neural", "en-US-AlloyTurboMultilingualNeural", "en-US-AmandaMultilingualNeural", "en-US-BlueNeural", "en-US-DavisMultilingualNeural", "en-US-DerekMultilingualNeural", "en-US-DustinMultilingualNeural", "en-US-EchoTurboMultilingualNeural", "en-US-EvelynMultilingualNeural", "en-US-FableTurboMultilingualNeural", "en-US-LewisMultilingualNeural", "en-US-LolaMultilingualNeural", "en-US-NancyMultilingualNeural", "en-US-NovaTurboMultilingualNeural", "en-US-OnyxTurboMultilingualNeural", "en-US-PhoebeMultilingualNeural", "en-US-SamuelMultilingualNeural", "en-US-SerenaMultilingualNeural", "en-US-ShimmerTurboMultilingualNeural", "en-US-SteffanMultilingualNeural", "en-US-Andrew:DragonHDLatestNeural", "en-US-Andrew2:DragonHDLatestNeural", "en-US-Aria:DragonHDLatestNeural", "en-US-Ava:DragonHDLatestNeural", "en-US-Brian:DragonHDLatestNeural", "en-US-Davis:DragonHDLatestNeural", "en-US-Emma:DragonHDLatestNeural", "en-US-Emma2:DragonHDLatestNeural", "en-US-Jenny:DragonHDLatestNeural", "en-US-Steffan:DragonHDLatestNeural", "en-ZA-LeahNeural", "en-ZA-LukeNeural"];
4
+
5
+ export declare class AzureVoice extends MastraVoice {
6
+ private speechConfig?;
7
+ private listeningConfig?;
8
+ private speechSynthesizer?;
9
+ private speechRecognizer?;
10
+ /**
11
+ * Creates a new instance of AzureVoice for text-to-speech and speech-to-text services.
12
+ *
13
+ * @param {Object} config - Configuration options
14
+ * @param {AzureVoiceConfig} [config.speechModel] - Configuration for text-to-speech
15
+ * @param {AzureVoiceConfig} [config.listeningModel] - Configuration for speech-to-text
16
+ * @param {VoiceId} [config.speaker] - Default voice ID for speech synthesis
17
+ */
18
+ constructor({ speechModel, listeningModel, speaker, }?: {
19
+ speechModel?: AzureVoiceConfig;
20
+ listeningModel?: AzureVoiceConfig;
21
+ speaker?: VoiceId;
22
+ });
23
+ /**
24
+ * Gets a list of available voices for speech synthesis.
25
+ *
26
+ * @returns {Promise<Array<{ voiceId: string; language: string; region: string; }>>} List of available voices
27
+ */
28
+ getSpeakers(): Promise<{
29
+ voiceId: "af-ZA-AdriNeural" | "af-ZA-WillemNeural" | "am-ET-MekdesNeural" | "am-ET-AmehaNeural" | "ar-AE-FatimaNeural" | "ar-AE-HamdanNeural" | "ar-BH-LailaNeural" | "ar-BH-AliNeural" | "ar-DZ-AminaNeural" | "ar-DZ-IsmaelNeural" | "ar-EG-SalmaNeural" | "ar-EG-ShakirNeural" | "ar-IQ-RanaNeural" | "ar-IQ-BasselNeural" | "ar-JO-SanaNeural" | "ar-JO-TaimNeural" | "ar-KW-NouraNeural" | "ar-KW-FahedNeural" | "ar-LB-LaylaNeural" | "ar-LB-RamiNeural" | "ar-LY-ImanNeural" | "ar-LY-OmarNeural" | "ar-MA-MounaNeural" | "ar-MA-JamalNeural" | "ar-OM-AyshaNeural" | "ar-OM-AbdullahNeural" | "ar-QA-AmalNeural" | "ar-QA-MoazNeural" | "ar-SA-ZariyahNeural" | "ar-SA-HamedNeural" | "ar-SY-AmanyNeural" | "ar-SY-LaithNeural" | "ar-TN-ReemNeural" | "ar-TN-HediNeural" | "ar-YE-MaryamNeural" | "ar-YE-SalehNeural" | "as-IN-YashicaNeural" | "as-IN-PriyomNeural" | "az-AZ-BanuNeural" | "az-AZ-BabekNeural" | "bg-BG-KalinaNeural" | "bg-BG-BorislavNeural" | "bn-BD-NabanitaNeural" | "bn-BD-PradeepNeural" | "bn-IN-TanishaaNeural" | "bn-IN-BashkarNeural" | "bs-BA-VesnaNeural" | "bs-BA-GoranNeural" | "ca-ES-JoanaNeural" | "ca-ES-EnricNeural" | "ca-ES-AlbaNeural" | "cs-CZ-VlastaNeural" | "cs-CZ-AntoninNeural" | "cy-GB-NiaNeural" | "cy-GB-AledNeural" | "da-DK-ChristelNeural" | "da-DK-JeppeNeural" | "de-AT-IngridNeural" | "de-AT-JonasNeural" | "de-CH-LeniNeural" | "de-CH-JanNeural" | "de-DE-KatjaNeural" | "de-DE-ConradNeural" | "de-DE-SeraphinaMultilingualNeural" | "de-DE-FlorianMultilingualNeural" | "de-DE-AmalaNeural" | "de-DE-BerndNeural" | "de-DE-ChristophNeural" | "de-DE-ElkeNeural" | "de-DE-GiselaNeural" | "de-DE-KasperNeural" | "de-DE-KillianNeural" | "de-DE-KlarissaNeural" | "de-DE-KlausNeural" | "de-DE-LouisaNeural" | "de-DE-MajaNeural" | "de-DE-RalfNeural" | "de-DE-TanjaNeural" | "de-DE-Seraphina:DragonHDLatestNeural" | "el-GR-AthinaNeural" | "el-GR-NestorasNeural" | "en-AU-NatashaNeural" | "en-AU-WilliamNeural" | "en-AU-AnnetteNeural" | "en-AU-CarlyNeural" | "en-AU-DarrenNeural" | "en-AU-DuncanNeural" | "en-AU-ElsieNeural" | "en-AU-FreyaNeural" | "en-AU-JoanneNeural" | "en-AU-KenNeural" | "en-AU-KimNeural" | "en-AU-NeilNeural" | "en-AU-TimNeural" | "en-AU-TinaNeural" | "en-CA-ClaraNeural" | "en-CA-LiamNeural" | "en-GB-SoniaNeural" | "en-GB-RyanNeural" | "en-GB-LibbyNeural" | "en-GB-AdaMultilingualNeural" | "en-GB-OllieMultilingualNeural" | "en-GB-AbbiNeural" | "en-GB-AlfieNeural" | "en-GB-BellaNeural" | "en-GB-ElliotNeural" | "en-GB-EthanNeural" | "en-GB-HollieNeural" | "en-GB-MaisieNeural" | "en-GB-NoahNeural" | "en-GB-OliverNeural" | "en-GB-OliviaNeural" | "en-GB-ThomasNeural" | "en-GB-MiaNeural" | "en-HK-YanNeural" | "en-HK-SamNeural" | "en-IE-EmilyNeural" | "en-IE-ConnorNeural" | "en-IN-AaravNeural" | "en-IN-AashiNeural" | "en-IN-AnanyaNeural" | "en-IN-KavyaNeural" | "en-IN-KunalNeural" | "en-IN-NeerjaNeural" | "en-IN-PrabhatNeural" | "en-IN-RehaanNeural" | "en-IN-AartiNeural" | "en-IN-ArjunNeural" | "en-KE-AsiliaNeural" | "en-KE-ChilembaNeural" | "en-NG-EzinneNeural" | "en-NG-AbeoNeural" | "en-NZ-MollyNeural" | "en-NZ-MitchellNeural" | "en-PH-RosaNeural" | "en-PH-JamesNeural" | "en-SG-LunaNeural" | "en-SG-WayneNeural" | "en-TZ-ImaniNeural" | "en-TZ-ElimuNeural" | "en-US-AvaMultilingualNeural" | "en-US-AndrewMultilingualNeural" | "en-US-EmmaMultilingualNeural" | "en-US-BrianMultilingualNeural" | "en-US-AvaNeural" | "en-US-AndrewNeural" | "en-US-EmmaNeural" | "en-US-BrianNeural" | "en-US-JennyNeural" | "en-US-GuyNeural" | "en-US-AriaNeural" | "en-US-DavisNeural" | "en-US-JaneNeural" | "en-US-JasonNeural" | "en-US-KaiNeural" | "en-US-LunaNeural" | "en-US-SaraNeural" | "en-US-TonyNeural" | "en-US-NancyNeural" | "en-US-CoraMultilingualNeural" | "en-US-ChristopherMultilingualNeural" | "en-US-BrandonMultilingualNeural" | "en-US-AmberNeural" | "en-US-AnaNeural" | "en-US-AshleyNeural" | "en-US-BrandonNeural" | "en-US-ChristopherNeural" | "en-US-CoraNeural" | "en-US-ElizabethNeural" | "en-US-EricNeural" | "en-US-JacobNeural" | "en-US-JennyMultilingualNeural" | "en-US-MichelleNeural" | "en-US-MonicaNeural" | "en-US-RogerNeural" | "en-US-RyanMultilingualNeural" | "en-US-SteffanNeural" | "en-US-AdamMultilingualNeural" | "en-US-AIGenerate1Neural" | "en-US-AIGenerate2Neural" | "en-US-AlloyTurboMultilingualNeural" | "en-US-AmandaMultilingualNeural" | "en-US-BlueNeural" | "en-US-DavisMultilingualNeural" | "en-US-DerekMultilingualNeural" | "en-US-DustinMultilingualNeural" | "en-US-EchoTurboMultilingualNeural" | "en-US-EvelynMultilingualNeural" | "en-US-FableTurboMultilingualNeural" | "en-US-LewisMultilingualNeural" | "en-US-LolaMultilingualNeural" | "en-US-NancyMultilingualNeural" | "en-US-NovaTurboMultilingualNeural" | "en-US-OnyxTurboMultilingualNeural" | "en-US-PhoebeMultilingualNeural" | "en-US-SamuelMultilingualNeural" | "en-US-SerenaMultilingualNeural" | "en-US-ShimmerTurboMultilingualNeural" | "en-US-SteffanMultilingualNeural" | "en-US-Andrew:DragonHDLatestNeural" | "en-US-Andrew2:DragonHDLatestNeural" | "en-US-Aria:DragonHDLatestNeural" | "en-US-Ava:DragonHDLatestNeural" | "en-US-Brian:DragonHDLatestNeural" | "en-US-Davis:DragonHDLatestNeural" | "en-US-Emma:DragonHDLatestNeural" | "en-US-Emma2:DragonHDLatestNeural" | "en-US-Jenny:DragonHDLatestNeural" | "en-US-Steffan:DragonHDLatestNeural" | "en-ZA-LeahNeural" | "en-ZA-LukeNeural";
30
+ language: string | undefined;
31
+ region: string | undefined;
32
+ }[]>;
33
+ /**
34
+ * Converts text to speech using Azure's Text-to-Speech service.
35
+ *
36
+ * @param {string | NodeJS.ReadableStream} input - Text to convert to speech
37
+ * @param {Object} [options] - Optional parameters
38
+ * @param {string} [options.speaker] - Voice ID to use for synthesis
39
+ * @returns {Promise<NodeJS.ReadableStream>} Stream containing the synthesized audio
40
+ * @throws {Error} If speech model is not configured or synthesis fails
41
+ */
42
+ speak(input: string | NodeJS.ReadableStream, options?: {
43
+ speaker?: string;
44
+ [key: string]: any;
45
+ }): Promise<NodeJS.ReadableStream>;
46
+ /**
47
+ * Transcribes audio (STT) from a Node.js stream using Azure.
48
+ *
49
+ * @param {NodeJS.ReadableStream} audioStream - The audio to be transcribed, must be in .wav format.
50
+ * @returns {Promise<string>} - The recognized text.
51
+ */
52
+ listen(audioStream: NodeJS.ReadableStream): Promise<string>;
53
+ }
54
+
55
+ declare interface AzureVoiceConfig {
56
+ apiKey?: string;
57
+ region?: string;
58
+ voiceName?: string;
59
+ language?: string;
60
+ }
61
+
62
+ export declare type VoiceId = (typeof AZURE_VOICES)[number];
63
+
64
+ export { }