PyPI - xinference - Versions diffs - 1.2.1__py3-none-any.whl → 1.3.0__py3-none-any.whl - Mend - Supply Chain Defender

xinference 1.2.1py3-none-any.whl → 1.3.0py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of xinference might be problematic. Click here for more details.

Files changed (80) hide show

xinference/model/llm/llm_family.py CHANGED Viewed

@@ -134,7 +134,7 @@ class LLMFamilyV1(BaseModel):
     model_name: str
     model_lang: List[str]
     model_ability: List[
-        Literal["embed", "generate", "chat", "tools", "vision", "audio"]
+        Literal["embed", "generate", "chat", "tools", "vision", "audio", "reasoning"]
     ]
     model_description: Optional[str]
     # reason for not required str here: legacy registration
@@ -143,6 +143,8 @@ class LLMFamilyV1(BaseModel):
     chat_template: Optional[str]
     stop_token_ids: Optional[List[int]]
     stop: Optional[List[str]]
+    reasoning_start_tag: Optional[str]
+    reasoning_end_tag: Optional[str]
 class CustomLLMFamilyV1(LLMFamilyV1):
@@ -538,7 +540,10 @@ def _generate_model_file_names(
     )
     need_merge = False
-    if llm_spec.quantization_parts is None:
+    if (
+        llm_spec.quantization_parts is None
+        or quantization not in llm_spec.quantization_parts
+    ):
         file_names.append(final_file_name)
     elif quantization is not None and quantization in llm_spec.quantization_parts:
         parts = llm_spec.quantization_parts[quantization]