RubyGems - elevenlabs - Versions diffs - 0.0.6 → 0.0.7 - Mend

elevenlabs 0.0.6 → 0.0.7

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: 22770e41ca0d3c88d2dc5f83e3e4d9de510610bf1a0adaf9bf675951a647ab30
-  data.tar.gz: 8f6ffc3ef844da02c3f45385a730ebbddfe3c711d11e1b983837153c8dbd859a
+  metadata.gz: 2daafae7b6dbf3724b93ce2022b2fe6ac3703bfbcac12326b75e1a37cd188a39
+  data.tar.gz: ba2227a765efc7538e4aadbe0fcb0917a55c1ba70540a2660b4c75b2545f85da
 SHA512:
-  metadata.gz: 1b094e808358b342f7fe8cb08cf993dbafe2bac989bcb1e4655d5de5dd2884ff83626cb098da9b7c06d60a697302d33e848419f80a26fb53f34198f7894390b7
-  data.tar.gz: 44ad5334ed45f2628a22be91a0ed307b74f7e8611f6ba71cca0b4ee79f7e510f1f655e45321e47060b413e476ed9ea9913e304a44a789ec388bb7411235d42b4
+  metadata.gz: 07d40969dd5fdf8926c2f09c21359df4b5060b1f212797de03ef16c4fcf0dc2b6495c476a9a0741247dccf95dd30fde8d6da7404370b6acc4b61a0ea0ce8f7cd
+  data.tar.gz: 927e01fdc01e4f985466b62e2725f676117f757d3f809d8b4da7ea420e54c0e0a57ab2279cf031dd466451ce8c7a96dd76219da4a7e3d02ada842682a20246f8

data/README.md CHANGED Viewed

@@ -14,6 +14,7 @@ This gem provides an easy-to-use interface for:
 - **Converting text to speech** and retrieving the generated audio
 - **Designing a voice** based on a text description
 - **Streaming text-to-speech audio**
+- **Music Generation**
 All requests are handled via [Faraday](https://github.com/lostisland/faraday).
@@ -304,7 +305,7 @@ Designed voices cannot be used for TTS until they are created in your account.
 If the voice is not immediately available for TTS, wait a few seconds or check its status via client.get_voice(voice_id) until it’s "active".
-10. Create a multi-speaker dialogue
+11. Create a multi-speaker dialogue
 ```ruby
 inputs = [{text: "It smells like updog in here", voice_id: "TX3LPaxmHKxFdv7VOQHJ"}, {text: "What's updog?", voice_id: "RILOU7YmBhvwJGDGjNmP"}, {text: "Not much, you?", voice_id: "TX3LPaxmHKxFdv7VOQHJ"}]
@@ -312,6 +313,33 @@ audio_data = client.text_to_dialogue(inputs)
 File.open("what's updog.mp3", "wb") { |f| f.write(audio_data) }
 ```
+12. **Generate Music from prompt**
+```ruby
+audio = client.compose_music(prompt: "Lo-fi hip hop beat", music_length_ms: 30000)
+File.binwrite("lofi.mp3", audio)
+```
+12. **Stream Music Generated from prompt**
+```ruby
+File.open("epic_stream.mp3", "wb") do |f|
+  client.compose_music_stream(prompt: "Epic orchestral build", music_length_ms: 60000) do |chunk|
+    f.write(chunk)
+  end
+end
+```
+13. **Generate Music with Detailed Metadata (metadata + audio) from prompt**
+```ruby
+result = client.compose_music_detailed(prompt: "Jazz piano trio", music_length_ms: 20000)
+puts result # raw multipart data (needs parsing)
+```
+14. **Create a music composition plan from prompt**
+```ruby
+plan = client.create_music_plan(prompt: "Upbeat pop song with verse and chorus", music_length_ms: 60000)
+puts plan[:sections]
+```
 ---
 ## Error Handling
@@ -368,7 +396,7 @@ gem build elevenlabs.gemspec
 Install the gem locally:
 ```bash
-gem install ./elevenlabs-0.0.5.gem
+gem install ./elevenlabs-0.0.7.gem
 ```
 ---

data/lib/elevenlabs/client.rb CHANGED Viewed

@@ -405,6 +405,110 @@ module Elevenlabs
       voice_id.in?(active_voices)
     end
+    #####################################################
+    #                     Music API                     #
+    #####################################################
+    # 1. Compose music (basic)
+    # POST /v1/music
+    def compose_music(options = {})
+      endpoint = "/v1/music"
+      request_body = {
+        prompt: options[:prompt],
+        composition_plan: options[:composition_plan],
+        music_length_ms: options[:music_length_ms],
+        model_id: options[:model_id] || "music_v1"
+      }.compact
+      headers = default_headers.merge("Accept" => "audio/mpeg")
+      query = {}
+      query[:output_format] = options[:output_format] if options[:output_format]
+      response = @connection.post("#{endpoint}?#{URI.encode_www_form(query)}") do |req|
+        req.headers = headers
+        req.body = request_body.to_json
+      end
+      response.body # raw binary audio
+    rescue Faraday::ClientError => e
+      handle_error(e)
+    end
+    # 2. Stream music
+    # POST /v1/music/stream
+    def compose_music_stream(options = {}, &block)
+      endpoint = "/v1/music/stream"
+      request_body = {
+        prompt: options[:prompt],
+        composition_plan: options[:composition_plan],
+        music_length_ms: options[:music_length_ms],
+        model_id: options[:model_id] || "music_v1"
+      }.compact
+      headers = default_headers.merge("Accept" => "audio/mpeg")
+      query = {}
+      query[:output_format] = options[:output_format] if options[:output_format]
+      @connection.post("#{endpoint}?#{URI.encode_www_form(query)}") do |req|
+        req.options.on_data = Proc.new do |chunk, _|
+          block.call(chunk) if block
+        end
+        req.headers = headers
+        req.body = request_body.to_json
+      end
+      nil # audio streamed via block
+    rescue Faraday::ClientError => e
+      handle_error(e)
+    end
+    # 3. Compose detailed music (metadata + audio)
+    # POST /v1/music/detailed
+    def compose_music_detailed(options = {})
+      endpoint = "/v1/music/detailed"
+      request_body = {
+        prompt: options[:prompt],
+        composition_plan: options[:composition_plan],
+        music_length_ms: options[:music_length_ms],
+        model_id: options[:model_id] || "music_v1"
+      }.compact
+      headers = default_headers
+      query = {}
+      query[:output_format] = options[:output_format] if options[:output_format]
+      response = @connection.post("#{endpoint}?#{URI.encode_www_form(query)}") do |req|
+        req.headers = headers
+        req.body = request_body.to_json
+      end
+      response.body # multipart/mixed with JSON + binary audio
+    rescue Faraday::ClientError => e
+      handle_error(e)
+    end
+    # 4. Create a composition plan
+    # POST /v1/music/plan
+    def create_music_plan(options = {})
+      endpoint = "/v1/music/plan"
+      request_body = {
+        prompt: options[:prompt],
+        music_length_ms: options[:music_length_ms],
+        source_composition_plan: options[:source_composition_plan],
+        model_id: options[:model_id] || "music_v1"
+      }.compact
+      response = @connection.post(endpoint) do |req|
+        req.headers = default_headers
+        req.body = request_body.to_json
+      end
+      JSON.parse(response.body, symbolize_names: true)
+    rescue Faraday::ClientError => e
+      handle_error(e)
+    end
     private
     # Common headers needed by Elevenlabs

data/lib/elevenlabs.rb CHANGED Viewed

@@ -5,7 +5,7 @@ require_relative "elevenlabs/client"
 require_relative "elevenlabs/errors"
 module Elevenlabs
-  VERSION = "0.0.6"
+  VERSION = "0.0.7"
   # Optional global configuration
   class << self

metadata CHANGED Viewed

@@ -1,14 +1,14 @@
 --- !ruby/object:Gem::Specification
 name: elevenlabs
 version: !ruby/object:Gem::Version
-  version: 0.0.6
+  version: 0.0.7
 platform: ruby
 authors:
 - hackliteracy
 autorequire:
 bindir: bin
 cert_chain: []
-date: 2025-08-23 00:00:00.000000000 Z
+date: 2025-08-25 00:00:00.000000000 Z
 dependencies:
 - !ruby/object:Gem::Dependency
   name: faraday
@@ -39,7 +39,8 @@ dependencies:
       - !ruby/object:Gem::Version
         version: '1.1'
 description: This gem provides a convenient Ruby interface to the ElevenLabs TTS,
-  Voice Cloning, Voice Design, Voice dialogues and Streaming endpoints.
+  Voice Cloning, Voice Design, Voice dialogues, TTS Streaming, Music Generation and
+  Streaming endpoints.
 email:
 - hackliteracy@gmail.com
 executables: []