npm - @huggingface/inference - Versions diffs - 2.8.1 → 3.0.1 - Mend

@huggingface/inference 2.8.1 → 3.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (50) hide show

package/LICENSE +1 -1
package/README.md +45 -17
package/dist/index.cjs +388 -134
package/dist/index.js +383 -134
package/dist/src/config.d.ts +3 -0
package/dist/src/config.d.ts.map +1 -0
package/dist/src/index.d.ts +5 -0
package/dist/src/index.d.ts.map +1 -1
package/dist/src/lib/getDefaultTask.d.ts +0 -1
package/dist/src/lib/getDefaultTask.d.ts.map +1 -1
package/dist/src/lib/makeRequestOptions.d.ts.map +1 -1
package/dist/src/providers/fal-ai.d.ts +6 -0
package/dist/src/providers/fal-ai.d.ts.map +1 -0
package/dist/src/providers/replicate.d.ts +6 -0
package/dist/src/providers/replicate.d.ts.map +1 -0
package/dist/src/providers/sambanova.d.ts +6 -0
package/dist/src/providers/sambanova.d.ts.map +1 -0
package/dist/src/providers/together.d.ts +12 -0
package/dist/src/providers/together.d.ts.map +1 -0
package/dist/src/providers/types.d.ts +4 -0
package/dist/src/providers/types.d.ts.map +1 -0
package/dist/src/tasks/audio/automaticSpeechRecognition.d.ts.map +1 -1
package/dist/src/tasks/audio/textToSpeech.d.ts.map +1 -1
package/dist/src/tasks/custom/request.d.ts +1 -1
package/dist/src/tasks/custom/request.d.ts.map +1 -1
package/dist/src/tasks/custom/streamingRequest.d.ts.map +1 -1
package/dist/src/tasks/cv/textToImage.d.ts +8 -0
package/dist/src/tasks/cv/textToImage.d.ts.map +1 -1
package/dist/src/tasks/nlp/chatCompletion.d.ts.map +1 -1
package/dist/src/tasks/nlp/textGeneration.d.ts.map +1 -1
package/dist/src/types.d.ts +16 -2
package/dist/src/types.d.ts.map +1 -1
package/package.json +2 -2
package/src/config.ts +2 -0
package/src/index.ts +5 -0
package/src/lib/getDefaultTask.ts +1 -1
package/src/lib/makeRequestOptions.ts +201 -59
package/src/providers/fal-ai.ts +23 -0
package/src/providers/replicate.ts +16 -0
package/src/providers/sambanova.ts +23 -0
package/src/providers/together.ts +60 -0
package/src/providers/types.ts +6 -0
package/src/tasks/audio/automaticSpeechRecognition.ts +10 -1
package/src/tasks/audio/textToSpeech.ts +17 -2
package/src/tasks/custom/request.ts +12 -6
package/src/tasks/custom/streamingRequest.ts +18 -3
package/src/tasks/cv/textToImage.ts +44 -1
package/src/tasks/nlp/chatCompletion.ts +2 -2
package/src/tasks/nlp/textGeneration.ts +43 -9
package/src/types.ts +20 -2

package/LICENSE CHANGED Viewed

@@ -1,6 +1,6 @@
 MIT License
-Copyright (c) 2022 Tim Mikeladze
+Copyright (c) 2022 Tim Mikeladze and the Hugging Face team
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal

package/README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 # 🤗 Hugging Face Inference Endpoints
-A Typescript powered wrapper for the Hugging Face Inference Endpoints API. Learn more about Inference Endpoints at [Hugging Face](https://huggingface.co/inference-endpoints).
-It works with both [Inference API (serverless)](https://huggingface.co/docs/api-inference/index) and [Inference Endpoints (dedicated)](https://huggingface.co/docs/inference-endpoints/index).
+A Typescript powered wrapper for the Hugging Face Inference API (serverless), Inference Endpoints (dedicated), and third-party Inference Providers.
+It works with [Inference API (serverless)](https://huggingface.co/docs/api-inference/index) and [Inference Endpoints (dedicated)](https://huggingface.co/docs/inference-endpoints/index), and even with supported third-party Inference Providers.
 Check out the [full documentation](https://huggingface.co/docs/huggingface.js/inference/README).
@@ -42,7 +42,40 @@ const hf = new HfInference('your access token')
 Your access token should be kept private. If you need to protect it in front-end applications, we suggest setting up a proxy server that stores the access token.
-#### Tree-shaking
+### Third-party inference providers
+You can send inference requests to third-party providers with the inference client.
+Currently, we support the following providers: [Fal.ai](https://fal.ai), [Replicate](https://replicate.com), [Together](https://together.xyz) and [Sambanova](https://sambanova.ai).
+To send requests to a third-party provider, you have to pass the `provider` parameter to the inference function. Make sure your request is authenticated with an access token.
+```ts
+const accessToken = "hf_..."; // Either a HF access token, or an API key from the third-party provider (Replicate in this example)
+const client = new HfInference(accessToken);
+await client.textToImage({
+  provider: "replicate",
+  model:"black-forest-labs/Flux.1-dev",
+  inputs: "A black forest cake"
+})
+```
+When authenticated with a Hugging Face access token, the request is routed through https://huggingface.co.
+When authenticated with a third-party provider key, the request is made directly against that provider's inference API.
+Only a subset of models are supported when requesting third-party providers. You can check the list of supported models per pipeline tasks here:
+- [Fal.ai supported models](./src/providers/fal-ai.ts)
+- [Replicate supported models](./src/providers/replicate.ts)
+- [Sambanova supported models](./src/providers/sambanova.ts)
+- [Together supported models](./src/providers/together.ts)
+- [HF Inference API (serverless)](https://huggingface.co/models?inference=warm&sort=trending)
+❗**Important note:** To be compatible, the third-party API must adhere to the "standard" shape API we expect on HF model pages for each pipeline task type.
+This is not an issue for LLMs as everyone converged on the OpenAI API anyways, but can be more tricky for other tasks like "text-to-image" or "automatic-speech-recognition" where there exists no standard API. Let us know if any help is needed or if we can make things easier for you!
+👋**Want to add another provider?** Get in touch if you'd like to add support for another Inference provider, and/or request it on https://huggingface.co/spaces/huggingface/HuggingDiscussions/discussions/49
+### Tree-shaking
 You can import the functions you need directly from the module instead of using the `HfInference` class.
@@ -91,23 +124,21 @@ Using the `chatCompletion` method, you can generate text with models compatible
 ```typescript
 // Non-streaming API
 const out = await hf.chatCompletion({
-  model: "mistralai/Mistral-7B-Instruct-v0.2",
-  messages: [{ role: "user", content: "Complete the this sentence with words one plus one is equal " }],
-  max_tokens: 500,
+  model: "meta-llama/Llama-3.1-8B-Instruct",
+  messages: [{ role: "user", content: "Hello, nice to meet you!" }],
+  max_tokens: 512,
   temperature: 0.1,
-  seed: 0,
 });
 // Streaming API
 let out = "";
 for await (const chunk of hf.chatCompletionStream({
-  model: "mistralai/Mistral-7B-Instruct-v0.2",
+  model: "meta-llama/Llama-3.1-8B-Instruct",
   messages: [
-    { role: "user", content: "Complete the equation 1+1= ,just the answer" },
+    { role: "user", content: "Can you help me solve an equation?" },
   ],
-  max_tokens: 500,
+  max_tokens: 512,
   temperature: 0.1,
-  seed: 0,
 })) {
   if (chunk.choices && chunk.choices.length > 0) {
     out += chunk.choices[0].delta.content;
@@ -396,11 +427,8 @@ Creates an image from a text prompt.
 ```typescript
 await hf.textToImage({
-  inputs: 'award winning high resolution photo of a giant tortoise/((ladybird)) hybrid, [trending on artstation]',
-  model: 'stabilityai/stable-diffusion-2',
-  parameters: {
-    negative_prompt: 'blurry',
-  }
+  model: 'black-forest-labs/FLUX.1-dev',
+  inputs: 'a picture of a green bird'
 })
 ```
@@ -583,7 +611,7 @@ const { generated_text } = await gpt2.textGeneration({inputs: 'The answer to the
 // Chat Completion Example
 const ep = hf.endpoint(
-  "https://api-inference.huggingface.co/models/mistralai/Mistral-7B-Instruct-v0.2"
+  "https://api-inference.huggingface.co/models/meta-llama/Llama-3.1-8B-Instruct"
 );
 const stream = ep.chatCompletionStream({
   model: "tgi",