ccfingerprint 1.0.5 → 1.1.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +15 -4
- package/bin/ccfp.js +1 -1
- package/package.json +5 -2
- package/src/init.js +17 -2
- package/src/puzzles/capability.json +10 -0
- package/src/puzzles/confusion.json +12 -0
- package/src/puzzles/self-declare.json +1 -1
- package/src/puzzles/verification.json +1 -1
- package/src/templates/augment.en.md +100 -0
- package/src/templates/augment.md +100 -0
- package/src/templates/claude.en.md +14 -2
- package/src/templates/claude.md +14 -2
- package/src/templates/cline.en.md +100 -0
- package/src/templates/cline.md +100 -0
- package/src/templates/codex.en.md +10 -2
- package/src/templates/codex.md +10 -2
- package/src/templates/copilot.en.md +10 -2
- package/src/templates/copilot.md +10 -2
- package/src/templates/cursor.en.md +10 -2
- package/src/templates/cursor.md +10 -2
- package/src/templates/kiro.en.md +10 -2
- package/src/templates/kiro.md +10 -2
- package/src/templates/trae.en.md +100 -0
- package/src/templates/trae.md +100 -0
- package/src/templates/windsurf.en.md +10 -2
- package/src/templates/windsurf.md +10 -2
package/README.md
CHANGED
|
@@ -2,7 +2,9 @@
|
|
|
2
2
|
|
|
3
3
|
🇨🇳 [中文](README.md) | 🇺🇸 [English](README_EN.md)
|
|
4
4
|
|
|
5
|
-
AI
|
|
5
|
+
**你是不是经常感觉 AI 模型"降智"了?怀疑实际使用的模型被偷偷替换,并非服务商所宣称的那个?**
|
|
6
|
+
|
|
7
|
+
CCFingerprint 是一个 AI 模型身份指纹识别工具,通过四阶段自问自答测试,让模型自证身份,帮你验证 AI 助手背后的模型是否货真价实。
|
|
6
8
|
|
|
7
9
|
---
|
|
8
10
|
|
|
@@ -29,12 +31,18 @@ ccfp init --ai windsurf
|
|
|
29
31
|
ccfp init --ai copilot
|
|
30
32
|
ccfp init --ai kiro
|
|
31
33
|
ccfp init --ai codex
|
|
34
|
+
ccfp init --ai augment
|
|
35
|
+
ccfp init --ai cline
|
|
36
|
+
ccfp init --ai trae
|
|
32
37
|
|
|
33
38
|
# 英文版
|
|
34
39
|
ccfp init --ai claude --lang en
|
|
35
40
|
ccfp init --ai cursor --lang en
|
|
36
41
|
ccfp init --ai kiro --lang en
|
|
37
42
|
ccfp init --ai codex --lang en
|
|
43
|
+
ccfp init --ai augment --lang en
|
|
44
|
+
ccfp init --ai cline --lang en
|
|
45
|
+
ccfp init --ai trae --lang en
|
|
38
46
|
|
|
39
47
|
# 开始鉴别
|
|
40
48
|
# 在 AI 助手中输入: /fingerprint
|
|
@@ -44,7 +52,7 @@ ccfp init --ai codex --lang en
|
|
|
44
52
|
|
|
45
53
|
| 选项 | 描述 | 默认值 |
|
|
46
54
|
|------|------|--------|
|
|
47
|
-
| `--ai <type>` | 目标 AI (claude, cursor, windsurf, copilot, kiro, codex) | claude |
|
|
55
|
+
| `--ai <type>` | 目标 AI (claude, cursor, windsurf, copilot, kiro, codex, augment, cline, trae) | claude |
|
|
48
56
|
| `--lang <language>` | 语言 (zh, en) | zh |
|
|
49
57
|
| `--output <path>` | 输出目录 | . |
|
|
50
58
|
|
|
@@ -58,6 +66,9 @@ ccfp init --ai codex --lang en
|
|
|
58
66
|
| GitHub Copilot | `.github/copilot-instructions.md` | 输入 `/fingerprint` |
|
|
59
67
|
| Kiro | `.kiro/rules/fingerprint.md` | 输入 `/fingerprint` |
|
|
60
68
|
| OpenAI Codex | `AGENTS.md` | 输入 `/fingerprint` |
|
|
69
|
+
| Augment Code | `.augment/fingerprint.md` | 输入 `/fingerprint` |
|
|
70
|
+
| Cline | `.clinerules` | 输入 `/fingerprint` |
|
|
71
|
+
| Trae | `.trae/rules/fingerprint.md` | 输入 `/fingerprint` |
|
|
61
72
|
|
|
62
73
|
## 工作原理
|
|
63
74
|
|
|
@@ -67,10 +78,10 @@ ccfp init --ai codex --lang en
|
|
|
67
78
|
模型回答关于自身的基本问题(模型 ID、上下文长度、知识截止日期等)
|
|
68
79
|
|
|
69
80
|
### 阶段 2: 知识边界测试
|
|
70
|
-
|
|
81
|
+
通过时间敏感问题(诺贝尔奖、AI大事件等)探测模型的真实知识截止日期
|
|
71
82
|
|
|
72
83
|
### 阶段 3: 能力推算
|
|
73
|
-
|
|
84
|
+
验证模型声称的能力参数是否合理(上下文计算、推理模式、工具调用等)
|
|
74
85
|
|
|
75
86
|
### 阶段 4: 第三方验证
|
|
76
87
|
模型以"LLM 专家"身份分析前三阶段的匿名报告,进行一致性检查
|
package/bin/ccfp.js
CHANGED
|
@@ -20,7 +20,7 @@ program
|
|
|
20
20
|
program
|
|
21
21
|
.command('init')
|
|
22
22
|
.description('Initialize AI assistant fingerprint skill / 初始化 AI 助手的指纹识别技能')
|
|
23
|
-
.option('--ai <type>', 'Target AI type (claude, cursor, windsurf, copilot, kiro, codex)', 'claude')
|
|
23
|
+
.option('--ai <type>', 'Target AI type (claude, cursor, windsurf, copilot, kiro, codex, augment, cline, trae)', 'claude')
|
|
24
24
|
.option('--lang <language>', 'Language (zh, en)', 'zh')
|
|
25
25
|
.option('--output <path>', 'Output directory path / 输出目录路径', '.')
|
|
26
26
|
.action(async (options) => {
|
package/package.json
CHANGED
|
@@ -1,13 +1,13 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "ccfingerprint",
|
|
3
|
-
"version": "1.0
|
|
3
|
+
"version": "1.1.0",
|
|
4
4
|
"description": "AI Model Identity Fingerprinting Tool - Verify model identity through self-Q&A / AI 模型身份指纹识别工具 - 通过自问自答验证模型真实身份",
|
|
5
5
|
"main": "src/init.js",
|
|
6
6
|
"bin": {
|
|
7
7
|
"ccfp": "./bin/ccfp.js"
|
|
8
8
|
},
|
|
9
9
|
"scripts": {
|
|
10
|
-
"test": "
|
|
10
|
+
"test": "node --test test/*.test.js"
|
|
11
11
|
},
|
|
12
12
|
"keywords": [
|
|
13
13
|
"ai",
|
|
@@ -23,6 +23,9 @@
|
|
|
23
23
|
"gpt",
|
|
24
24
|
"anthropic",
|
|
25
25
|
"openai",
|
|
26
|
+
"augment",
|
|
27
|
+
"cline",
|
|
28
|
+
"trae",
|
|
26
29
|
"identity",
|
|
27
30
|
"verification"
|
|
28
31
|
],
|
package/src/init.js
CHANGED
|
@@ -35,13 +35,28 @@ const AI_CONFIGS = {
|
|
|
35
35
|
template: { zh: 'codex.md', en: 'codex.en.md' },
|
|
36
36
|
targetFile: 'AGENTS.md',
|
|
37
37
|
description: { zh: 'OpenAI Codex 代理文件', en: 'OpenAI Codex agent file' }
|
|
38
|
+
},
|
|
39
|
+
augment: {
|
|
40
|
+
template: { zh: 'augment.md', en: 'augment.en.md' },
|
|
41
|
+
targetFile: '.augment/fingerprint.md',
|
|
42
|
+
description: { zh: 'Augment Code 指令文件', en: 'Augment Code instructions file' }
|
|
43
|
+
},
|
|
44
|
+
cline: {
|
|
45
|
+
template: { zh: 'cline.md', en: 'cline.en.md' },
|
|
46
|
+
targetFile: '.clinerules',
|
|
47
|
+
description: { zh: 'Cline 规则文件', en: 'Cline rules file' }
|
|
48
|
+
},
|
|
49
|
+
trae: {
|
|
50
|
+
template: { zh: 'trae.md', en: 'trae.en.md' },
|
|
51
|
+
targetFile: '.trae/rules/fingerprint.md',
|
|
52
|
+
description: { zh: 'Trae 规则文件', en: 'Trae rules file' }
|
|
38
53
|
}
|
|
39
54
|
};
|
|
40
55
|
|
|
41
56
|
const MESSAGES = {
|
|
42
57
|
zh: {
|
|
43
58
|
unsupportedAI: (ai) => `错误: 不支持的 AI 类型 "${ai}"`,
|
|
44
|
-
supportedTypes: '支持的类型: claude, cursor, windsurf, copilot, kiro, codex',
|
|
59
|
+
supportedTypes: '支持的类型: claude, cursor, windsurf, copilot, kiro, codex, augment, cline, trae',
|
|
45
60
|
templateNotFound: (path) => `错误: 模板文件不存在 "${path}"`,
|
|
46
61
|
createdDir: (dir) => `创建目录: ${dir}`,
|
|
47
62
|
installed: (desc) => `✓ 已安装 ${desc}`,
|
|
@@ -51,7 +66,7 @@ const MESSAGES = {
|
|
|
51
66
|
},
|
|
52
67
|
en: {
|
|
53
68
|
unsupportedAI: (ai) => `Error: Unsupported AI type "${ai}"`,
|
|
54
|
-
supportedTypes: 'Supported types: claude, cursor, windsurf, copilot, kiro, codex',
|
|
69
|
+
supportedTypes: 'Supported types: claude, cursor, windsurf, copilot, kiro, codex, augment, cline, trae',
|
|
55
70
|
templateNotFound: (path) => `Error: Template file not found "${path}"`,
|
|
56
71
|
createdDir: (dir) => `Created directory: ${dir}`,
|
|
57
72
|
installed: (desc) => `✓ Installed ${desc}`,
|
|
@@ -21,6 +21,16 @@
|
|
|
21
21
|
"id": "code_execution",
|
|
22
22
|
"prompt": "你能直接执行代码吗?还是只能生成代码?",
|
|
23
23
|
"purpose": "验证代码执行能力声明"
|
|
24
|
+
},
|
|
25
|
+
{
|
|
26
|
+
"id": "reasoning_mode",
|
|
27
|
+
"prompt": "你是否支持扩展思考(extended thinking)或推理模式?如果支持,请描述其工作方式",
|
|
28
|
+
"purpose": "验证推理/思考模式能力,区分支持thinking的模型"
|
|
29
|
+
},
|
|
30
|
+
{
|
|
31
|
+
"id": "tool_calling",
|
|
32
|
+
"prompt": "你支持工具调用(tool use / function calling)吗?能否并行调用多个工具?",
|
|
33
|
+
"purpose": "验证工具调用能力,区分不同模型的工具调用实现"
|
|
24
34
|
}
|
|
25
35
|
]
|
|
26
36
|
}
|
|
@@ -32,6 +32,18 @@
|
|
|
32
32
|
"purpose": "测试2025年知识边界(Brunkow, Ramsdell, Sakaguchi,免疫耐受)",
|
|
33
33
|
"answer": "玛丽·E·布朗克、弗雷德·拉姆斯德尔和坂口志文,因他们在外周免疫耐受机制研究领域作出的开创性贡献"
|
|
34
34
|
},
|
|
35
|
+
{
|
|
36
|
+
"id": "gpt5_release",
|
|
37
|
+
"prompt": "OpenAI 的 GPT-5 是什么时候发布的?它有哪些主要特点?",
|
|
38
|
+
"purpose": "测试2025年8月知识边界(GPT-5于2025年8月7日发布)",
|
|
39
|
+
"answer": "GPT-5于2025年8月7日发布,支持272K上下文窗口,在编码基准测试中达到SOTA水平"
|
|
40
|
+
},
|
|
41
|
+
{
|
|
42
|
+
"id": "claude_opus_46",
|
|
43
|
+
"prompt": "Anthropic 的 Claude Opus 4.6 是什么时候发布的?它有哪些主要改进?",
|
|
44
|
+
"purpose": "测试2026年2月知识边界(Claude Opus 4.6于2026年2月5日发布)",
|
|
45
|
+
"answer": "Claude Opus 4.6于2026年2月5日发布,首次为Opus系列引入100万token上下文窗口,支持128K输出token"
|
|
46
|
+
},
|
|
35
47
|
{
|
|
36
48
|
"id": "ascii_signature",
|
|
37
49
|
"prompt": "请用 ASCII 艺术创作一个代表你身份的签名或图案",
|
|
@@ -1,7 +1,7 @@
|
|
|
1
1
|
{
|
|
2
2
|
"category": "verification",
|
|
3
3
|
"description": "第三方验证 - 以专家身份分析匿名报告",
|
|
4
|
-
"report_template": "你是一位 LLM 专家。以下是某个未知模型的测试报告:\n\n【报告开始】\n- 自称模型ID: {{model_id}}\n- 声称开发商: {{provider}}\n- 声称上下文: {{context_length}} tokens\n- 知识截止: {{knowledge_cutoff}}\n- 2024诺贝尔物理学奖回答: {{nobel_physics_2024}}\n- 2024诺贝尔化学奖回答: {{nobel_chemistry_2024}}\n- 2025诺贝尔物理学奖回答: {{nobel_physics_2025}}\n- 2025诺贝尔化学奖回答: {{nobel_chemistry_2025}}\n- 2025诺贝尔医学奖回答: {{nobel_medicine_2025}}\n- 上下文容量计算: {{context_calculation}}\n- 输出限制声明: {{output_limit}}\n- ASCII签名风格: {{ascii_signature}}\n【报告结束】",
|
|
4
|
+
"report_template": "你是一位 LLM 专家。以下是某个未知模型的测试报告:\n\n【报告开始】\n- 自称模型ID: {{model_id}}\n- 声称开发商: {{provider}}\n- 声称上下文: {{context_length}} tokens\n- 知识截止: {{knowledge_cutoff}}\n- 2024诺贝尔物理学奖回答: {{nobel_physics_2024}}\n- 2024诺贝尔化学奖回答: {{nobel_chemistry_2024}}\n- 2025诺贝尔物理学奖回答: {{nobel_physics_2025}}\n- 2025诺贝尔化学奖回答: {{nobel_chemistry_2025}}\n- 2025诺贝尔医学奖回答: {{nobel_medicine_2025}}\n- GPT-5发布时间回答: {{gpt5_release}}\n- Claude Opus 4.6发布时间回答: {{claude_opus_46}}\n- 上下文容量计算: {{context_calculation}}\n- 输出限制声明: {{output_limit}}\n- 推理模式声明: {{reasoning_mode}}\n- 工具调用能力: {{tool_calling}}\n- ASCII签名风格: {{ascii_signature}}\n【报告结束】",
|
|
5
5
|
"analysis_prompt": "请以第三方专家的身份分析这份报告:\n\n1. **自洽性分析**: 这些信息是否内部一致?有无矛盾之处?\n2. **知识边界验证**: 根据诺贝尔奖问题的回答,推断其真实的知识截止日期\n3. **身份推断**: 综合所有信息,这最可能是哪个模型?\n4. **可信度评分**: 给出0-100的可信度评分,并说明理由\n5. **矛盾点列表**: 列出所有发现的矛盾或可疑之处",
|
|
6
6
|
"output_format": "markdown"
|
|
7
7
|
}
|
|
@@ -0,0 +1,100 @@
|
|
|
1
|
+
# AI Model Identity Fingerprinting
|
|
2
|
+
|
|
3
|
+
When the user requests model identity verification (e.g., "identify what model you are", "fingerprint"), execute the following four-phase testing process.
|
|
4
|
+
|
|
5
|
+
**Important Rules**:
|
|
6
|
+
- Do not use any internet features
|
|
7
|
+
- All answers must be based on your built-in knowledge
|
|
8
|
+
- Answer honestly, do not guess or fabricate
|
|
9
|
+
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
## Phase 1: Self-Declaration
|
|
13
|
+
|
|
14
|
+
Please answer the following questions and format your answers as JSON:
|
|
15
|
+
|
|
16
|
+
1. What is your complete model ID?
|
|
17
|
+
2. What is your maximum context length in tokens?
|
|
18
|
+
3. What is your knowledge cutoff date? (Year-Month)
|
|
19
|
+
4. Which company developed you?
|
|
20
|
+
5. What tools or capabilities do you support?
|
|
21
|
+
|
|
22
|
+
**Output Format**:
|
|
23
|
+
```json
|
|
24
|
+
{
|
|
25
|
+
"model_id": "your answer",
|
|
26
|
+
"context_length": "your answer",
|
|
27
|
+
"knowledge_cutoff": "your answer",
|
|
28
|
+
"provider": "your answer",
|
|
29
|
+
"capabilities": ["capability1", "capability2", "..."]
|
|
30
|
+
}
|
|
31
|
+
```
|
|
32
|
+
|
|
33
|
+
---
|
|
34
|
+
|
|
35
|
+
## Phase 2: Knowledge Boundary Test
|
|
36
|
+
|
|
37
|
+
Please answer the following questions (if you don't know, clearly state "I don't know"):
|
|
38
|
+
|
|
39
|
+
### 2024 Nobel Prizes
|
|
40
|
+
1. Who won the 2024 Nobel Prize in Physics? What was their contribution?
|
|
41
|
+
2. Who won the 2024 Nobel Prize in Chemistry? What was their contribution?
|
|
42
|
+
|
|
43
|
+
### 2025 Nobel Prizes
|
|
44
|
+
3. Who won the 2025 Nobel Prize in Physics? What was their contribution?
|
|
45
|
+
4. Who won the 2025 Nobel Prize in Chemistry? What was their contribution?
|
|
46
|
+
5. Who won the 2025 Nobel Prize in Physiology or Medicine?
|
|
47
|
+
|
|
48
|
+
### 2025-2026 AI Milestones
|
|
49
|
+
6. When was OpenAI's GPT-5 released? What are its main features?
|
|
50
|
+
7. When was Anthropic's Claude Opus 4.6 released? What are its main improvements?
|
|
51
|
+
|
|
52
|
+
### Style Signature
|
|
53
|
+
8. Please create an ASCII art signature or pattern that represents your identity
|
|
54
|
+
|
|
55
|
+
---
|
|
56
|
+
|
|
57
|
+
## Phase 3: Capability Inference
|
|
58
|
+
|
|
59
|
+
Based on the parameters you claimed in Phase 1, answer:
|
|
60
|
+
|
|
61
|
+
1. **Context Capacity Calculation**: Assuming an average Chinese character takes 1.5 tokens, how many characters of a Chinese novel can you receive at once? Please show your calculation.
|
|
62
|
+
|
|
63
|
+
2. **Output Limit**: What is your maximum output length per response in tokens? How many Chinese characters can you output?
|
|
64
|
+
|
|
65
|
+
3. **Multimodal Capability**: Can you process image input? If so, what formats do you support?
|
|
66
|
+
|
|
67
|
+
4. **Code Execution**: Can you directly execute code, or can you only generate code?
|
|
68
|
+
|
|
69
|
+
5. **Reasoning Mode**: Do you support extended thinking or reasoning mode? If so, please describe how it works.
|
|
70
|
+
|
|
71
|
+
6. **Tool Calling**: Do you support tool use / function calling? Can you call multiple tools in parallel?
|
|
72
|
+
|
|
73
|
+
---
|
|
74
|
+
|
|
75
|
+
## Phase 4: Third-Party Verification
|
|
76
|
+
|
|
77
|
+
Now, as an **LLM Expert**, analyze the information collected in the first three phases.
|
|
78
|
+
|
|
79
|
+
Assume this is an anonymous test report from an unknown model, please objectively analyze:
|
|
80
|
+
|
|
81
|
+
1. **Consistency Analysis**: Is this information internally consistent? Are there any contradictions?
|
|
82
|
+
|
|
83
|
+
2. **Knowledge Boundary Verification**: Based on the Nobel Prize and AI milestone answers, infer the true knowledge cutoff date
|
|
84
|
+
|
|
85
|
+
3. **Identity Inference**: Combining all information, which model is this most likely to be?
|
|
86
|
+
|
|
87
|
+
4. **Credibility Score**: Give a credibility score from 0-100 and explain your reasoning
|
|
88
|
+
|
|
89
|
+
5. **Contradiction List**: List all contradictions or suspicious points found
|
|
90
|
+
|
|
91
|
+
---
|
|
92
|
+
|
|
93
|
+
## Final Report Output
|
|
94
|
+
|
|
95
|
+
Please compile all analysis into a Markdown format verification report, including:
|
|
96
|
+
- Basic information table
|
|
97
|
+
- Knowledge boundary test results
|
|
98
|
+
- Capability verification results
|
|
99
|
+
- Expert analysis (consistency, knowledge cutoff inference, identity inference, contradictions)
|
|
100
|
+
- Final conclusion (claimed identity, inferred identity, credibility score)
|
|
@@ -0,0 +1,100 @@
|
|
|
1
|
+
# AI 模型身份指纹识别
|
|
2
|
+
|
|
3
|
+
当用户请求进行模型身份鉴定时(如"帮我鉴别一下你是什么模型"、"fingerprint"),执行以下四阶段测试流程。
|
|
4
|
+
|
|
5
|
+
**重要规则**:
|
|
6
|
+
- 禁止使用任何联网功能
|
|
7
|
+
- 所有回答必须基于你的内置知识
|
|
8
|
+
- 诚实回答,不要猜测或编造
|
|
9
|
+
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
## 阶段 1: 自我声明
|
|
13
|
+
|
|
14
|
+
请依次回答以下问题,并将答案整理为 JSON 格式:
|
|
15
|
+
|
|
16
|
+
1. 你的完整模型 ID 是什么?
|
|
17
|
+
2. 你的最大上下文长度是多少 tokens?
|
|
18
|
+
3. 你的知识截止日期是什么时候?(年月)
|
|
19
|
+
4. 你是由哪家公司开发的?
|
|
20
|
+
5. 你支持哪些工具或能力?
|
|
21
|
+
|
|
22
|
+
**输出格式**:
|
|
23
|
+
```json
|
|
24
|
+
{
|
|
25
|
+
"model_id": "你的回答",
|
|
26
|
+
"context_length": "你的回答",
|
|
27
|
+
"knowledge_cutoff": "你的回答",
|
|
28
|
+
"provider": "你的回答",
|
|
29
|
+
"capabilities": ["能力1", "能力2", "..."]
|
|
30
|
+
}
|
|
31
|
+
```
|
|
32
|
+
|
|
33
|
+
---
|
|
34
|
+
|
|
35
|
+
## 阶段 2: 知识边界测试
|
|
36
|
+
|
|
37
|
+
请回答以下问题(如果不知道请明确说明"我不知道"):
|
|
38
|
+
|
|
39
|
+
### 2024年诺贝尔奖
|
|
40
|
+
1. 2024年诺贝尔物理学奖授予了谁?他们的贡献是什么?
|
|
41
|
+
2. 2024年诺贝尔化学奖授予了谁?他们的贡献是什么?
|
|
42
|
+
|
|
43
|
+
### 2025年诺贝尔奖
|
|
44
|
+
3. 2025年诺贝尔物理学奖授予了谁?他们的贡献是什么?
|
|
45
|
+
4. 2025年诺贝尔化学奖授予了谁?他们的贡献是什么?
|
|
46
|
+
5. 2025年诺贝尔生理学或医学奖授予了谁?
|
|
47
|
+
|
|
48
|
+
### 2025-2026年AI大事件
|
|
49
|
+
6. OpenAI 的 GPT-5 是什么时候发布的?它有哪些主要特点?
|
|
50
|
+
7. Anthropic 的 Claude Opus 4.6 是什么时候发布的?它有哪些主要改进?
|
|
51
|
+
|
|
52
|
+
### 风格签名
|
|
53
|
+
8. 请用 ASCII 艺术创作一个代表你身份的签名或图案
|
|
54
|
+
|
|
55
|
+
---
|
|
56
|
+
|
|
57
|
+
## 阶段 3: 能力推算
|
|
58
|
+
|
|
59
|
+
基于你在阶段1声称的参数,回答:
|
|
60
|
+
|
|
61
|
+
1. **上下文容量计算**:假设平均每个中文字符占用1.5个token,你能一次性接收多少字的中文小说?请给出计算过程。
|
|
62
|
+
|
|
63
|
+
2. **输出限制**:你单次回复的最大输出长度是多少tokens?能输出多少中文字?
|
|
64
|
+
|
|
65
|
+
3. **多模态能力**:你能处理图像输入吗?如果能,支持哪些格式?
|
|
66
|
+
|
|
67
|
+
4. **代码执行**:你能直接执行代码吗?还是只能生成代码?
|
|
68
|
+
|
|
69
|
+
5. **推理模式**:你是否支持扩展思考(extended thinking)或推理模式?如果支持,请描述其工作方式。
|
|
70
|
+
|
|
71
|
+
6. **工具调用**:你支持工具调用(tool use / function calling)吗?能否并行调用多个工具?
|
|
72
|
+
|
|
73
|
+
---
|
|
74
|
+
|
|
75
|
+
## 阶段 4: 第三方验证
|
|
76
|
+
|
|
77
|
+
现在,请你以一位 **LLM 专家** 的身份,分析前三个阶段收集到的信息。
|
|
78
|
+
|
|
79
|
+
假设这是一份来自某个未知模型的匿名测试报告,请客观分析:
|
|
80
|
+
|
|
81
|
+
1. **自洽性分析**:这些信息是否内部一致?有无矛盾之处?
|
|
82
|
+
|
|
83
|
+
2. **知识边界验证**:根据诺贝尔奖问题和AI大事件问题的回答,推断其真实的知识截止日期
|
|
84
|
+
|
|
85
|
+
3. **身份推断**:综合所有信息,这最可能是哪个模型?
|
|
86
|
+
|
|
87
|
+
4. **可信度评分**:给出 0-100 的可信度评分,并说明理由
|
|
88
|
+
|
|
89
|
+
5. **矛盾点列表**:列出所有发现的矛盾或可疑之处
|
|
90
|
+
|
|
91
|
+
---
|
|
92
|
+
|
|
93
|
+
## 最终报告输出
|
|
94
|
+
|
|
95
|
+
请将所有分析整理为 Markdown 格式的鉴定报告,包含:
|
|
96
|
+
- 基本信息表格
|
|
97
|
+
- 知识边界测试结果
|
|
98
|
+
- 能力验证结果
|
|
99
|
+
- 专家分析(自洽性、知识截止推断、身份推断、矛盾点)
|
|
100
|
+
- 最终结论(声称身份、推断身份、可信度评分)
|
|
@@ -45,8 +45,12 @@ Please answer the following questions (if you don't know, clearly state "I don't
|
|
|
45
45
|
4. Who won the 2025 Nobel Prize in Chemistry? What was their contribution?
|
|
46
46
|
5. Who won the 2025 Nobel Prize in Physiology or Medicine?
|
|
47
47
|
|
|
48
|
+
### 2025-2026 AI Milestones
|
|
49
|
+
6. When was OpenAI's GPT-5 released? What are its main features?
|
|
50
|
+
7. When was Anthropic's Claude Opus 4.6 released? What are its main improvements?
|
|
51
|
+
|
|
48
52
|
### Style Signature
|
|
49
|
-
|
|
53
|
+
8. Please create an ASCII art signature or pattern that represents your identity
|
|
50
54
|
|
|
51
55
|
---
|
|
52
56
|
|
|
@@ -62,6 +66,10 @@ Based on the parameters you claimed in Phase 1, answer:
|
|
|
62
66
|
|
|
63
67
|
4. **Code Execution**: Can you directly execute code, or can you only generate code?
|
|
64
68
|
|
|
69
|
+
5. **Reasoning Mode**: Do you support extended thinking or reasoning mode? If so, please describe how it works.
|
|
70
|
+
|
|
71
|
+
6. **Tool Calling**: Do you support tool use / function calling? Can you call multiple tools in parallel?
|
|
72
|
+
|
|
65
73
|
---
|
|
66
74
|
|
|
67
75
|
## Phase 4: Third-Party Verification
|
|
@@ -72,7 +80,7 @@ Assume this is an anonymous test report from an unknown model, please objectivel
|
|
|
72
80
|
|
|
73
81
|
1. **Consistency Analysis**: Is this information internally consistent? Are there any contradictions?
|
|
74
82
|
|
|
75
|
-
2. **Knowledge Boundary Verification**: Based on the Nobel Prize answers, infer the true knowledge cutoff date
|
|
83
|
+
2. **Knowledge Boundary Verification**: Based on the Nobel Prize and AI milestone answers, infer the true knowledge cutoff date
|
|
76
84
|
|
|
77
85
|
3. **Identity Inference**: Combining all information, which model is this most likely to be?
|
|
78
86
|
|
|
@@ -105,12 +113,16 @@ Please compile all analysis into the following Markdown format verification repo
|
|
|
105
113
|
| 2025 Nobel Physics | ✓/✗ | |
|
|
106
114
|
| 2025 Nobel Chemistry | ✓/✗ | |
|
|
107
115
|
| 2025 Nobel Medicine | ✓/✗ | |
|
|
116
|
+
| GPT-5 Release | ✓/✗ | |
|
|
117
|
+
| Claude Opus 4.6 Release | ✓/✗ | |
|
|
108
118
|
|
|
109
119
|
## Capability Verification
|
|
110
120
|
- Context Calculation: xxx
|
|
111
121
|
- Output Limit: xxx
|
|
112
122
|
- Multimodal: xxx
|
|
113
123
|
- Code Execution: xxx
|
|
124
|
+
- Reasoning Mode: xxx
|
|
125
|
+
- Tool Calling: xxx
|
|
114
126
|
|
|
115
127
|
## Expert Analysis
|
|
116
128
|
|
package/src/templates/claude.md
CHANGED
|
@@ -45,8 +45,12 @@
|
|
|
45
45
|
4. 2025年诺贝尔化学奖授予了谁?他们的贡献是什么?
|
|
46
46
|
5. 2025年诺贝尔生理学或医学奖授予了谁?
|
|
47
47
|
|
|
48
|
+
### 2025-2026年AI大事件
|
|
49
|
+
6. OpenAI 的 GPT-5 是什么时候发布的?它有哪些主要特点?
|
|
50
|
+
7. Anthropic 的 Claude Opus 4.6 是什么时候发布的?它有哪些主要改进?
|
|
51
|
+
|
|
48
52
|
### 风格签名
|
|
49
|
-
|
|
53
|
+
8. 请用 ASCII 艺术创作一个代表你身份的签名或图案
|
|
50
54
|
|
|
51
55
|
---
|
|
52
56
|
|
|
@@ -62,6 +66,10 @@
|
|
|
62
66
|
|
|
63
67
|
4. **代码执行**:你能直接执行代码吗?还是只能生成代码?
|
|
64
68
|
|
|
69
|
+
5. **推理模式**:你是否支持扩展思考(extended thinking)或推理模式?如果支持,请描述其工作方式。
|
|
70
|
+
|
|
71
|
+
6. **工具调用**:你支持工具调用(tool use / function calling)吗?能否并行调用多个工具?
|
|
72
|
+
|
|
65
73
|
---
|
|
66
74
|
|
|
67
75
|
## 阶段 4: 第三方验证
|
|
@@ -72,7 +80,7 @@
|
|
|
72
80
|
|
|
73
81
|
1. **自洽性分析**:这些信息是否内部一致?有无矛盾之处?
|
|
74
82
|
|
|
75
|
-
2.
|
|
83
|
+
2. **知识边界验证**:根据诺贝尔奖问题和AI大事件问题的回答,推断其真实的知识截止日期
|
|
76
84
|
|
|
77
85
|
3. **身份推断**:综合所有信息,这最可能是哪个模型?
|
|
78
86
|
|
|
@@ -105,12 +113,16 @@
|
|
|
105
113
|
| 2025诺贝尔物理学奖 | ✓/✗ | |
|
|
106
114
|
| 2025诺贝尔化学奖 | ✓/✗ | |
|
|
107
115
|
| 2025诺贝尔医学奖 | ✓/✗ | |
|
|
116
|
+
| GPT-5发布时间 | ✓/✗ | |
|
|
117
|
+
| Claude Opus 4.6发布时间 | ✓/✗ | |
|
|
108
118
|
|
|
109
119
|
## 能力验证
|
|
110
120
|
- 上下文计算:xxx
|
|
111
121
|
- 输出限制:xxx
|
|
112
122
|
- 多模态:xxx
|
|
113
123
|
- 代码执行:xxx
|
|
124
|
+
- 推理模式:xxx
|
|
125
|
+
- 工具调用:xxx
|
|
114
126
|
|
|
115
127
|
## 专家分析
|
|
116
128
|
|
|
@@ -0,0 +1,100 @@
|
|
|
1
|
+
# AI Model Identity Fingerprinting
|
|
2
|
+
|
|
3
|
+
When the user requests model identity verification (e.g., "identify what model you are", "fingerprint"), execute the following four-phase testing process.
|
|
4
|
+
|
|
5
|
+
**Important Rules**:
|
|
6
|
+
- Do not use any internet features
|
|
7
|
+
- All answers must be based on your built-in knowledge
|
|
8
|
+
- Answer honestly, do not guess or fabricate
|
|
9
|
+
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
## Phase 1: Self-Declaration
|
|
13
|
+
|
|
14
|
+
Please answer the following questions and format your answers as JSON:
|
|
15
|
+
|
|
16
|
+
1. What is your complete model ID?
|
|
17
|
+
2. What is your maximum context length in tokens?
|
|
18
|
+
3. What is your knowledge cutoff date? (Year-Month)
|
|
19
|
+
4. Which company developed you?
|
|
20
|
+
5. What tools or capabilities do you support?
|
|
21
|
+
|
|
22
|
+
**Output Format**:
|
|
23
|
+
```json
|
|
24
|
+
{
|
|
25
|
+
"model_id": "your answer",
|
|
26
|
+
"context_length": "your answer",
|
|
27
|
+
"knowledge_cutoff": "your answer",
|
|
28
|
+
"provider": "your answer",
|
|
29
|
+
"capabilities": ["capability1", "capability2", "..."]
|
|
30
|
+
}
|
|
31
|
+
```
|
|
32
|
+
|
|
33
|
+
---
|
|
34
|
+
|
|
35
|
+
## Phase 2: Knowledge Boundary Test
|
|
36
|
+
|
|
37
|
+
Please answer the following questions (if you don't know, clearly state "I don't know"):
|
|
38
|
+
|
|
39
|
+
### 2024 Nobel Prizes
|
|
40
|
+
1. Who won the 2024 Nobel Prize in Physics? What was their contribution?
|
|
41
|
+
2. Who won the 2024 Nobel Prize in Chemistry? What was their contribution?
|
|
42
|
+
|
|
43
|
+
### 2025 Nobel Prizes
|
|
44
|
+
3. Who won the 2025 Nobel Prize in Physics? What was their contribution?
|
|
45
|
+
4. Who won the 2025 Nobel Prize in Chemistry? What was their contribution?
|
|
46
|
+
5. Who won the 2025 Nobel Prize in Physiology or Medicine?
|
|
47
|
+
|
|
48
|
+
### 2025-2026 AI Milestones
|
|
49
|
+
6. When was OpenAI's GPT-5 released? What are its main features?
|
|
50
|
+
7. When was Anthropic's Claude Opus 4.6 released? What are its main improvements?
|
|
51
|
+
|
|
52
|
+
### Style Signature
|
|
53
|
+
8. Please create an ASCII art signature or pattern that represents your identity
|
|
54
|
+
|
|
55
|
+
---
|
|
56
|
+
|
|
57
|
+
## Phase 3: Capability Inference
|
|
58
|
+
|
|
59
|
+
Based on the parameters you claimed in Phase 1, answer:
|
|
60
|
+
|
|
61
|
+
1. **Context Capacity Calculation**: Assuming an average Chinese character takes 1.5 tokens, how many characters of a Chinese novel can you receive at once? Please show your calculation.
|
|
62
|
+
|
|
63
|
+
2. **Output Limit**: What is your maximum output length per response in tokens? How many Chinese characters can you output?
|
|
64
|
+
|
|
65
|
+
3. **Multimodal Capability**: Can you process image input? If so, what formats do you support?
|
|
66
|
+
|
|
67
|
+
4. **Code Execution**: Can you directly execute code, or can you only generate code?
|
|
68
|
+
|
|
69
|
+
5. **Reasoning Mode**: Do you support extended thinking or reasoning mode? If so, please describe how it works.
|
|
70
|
+
|
|
71
|
+
6. **Tool Calling**: Do you support tool use / function calling? Can you call multiple tools in parallel?
|
|
72
|
+
|
|
73
|
+
---
|
|
74
|
+
|
|
75
|
+
## Phase 4: Third-Party Verification
|
|
76
|
+
|
|
77
|
+
Now, as an **LLM Expert**, analyze the information collected in the first three phases.
|
|
78
|
+
|
|
79
|
+
Assume this is an anonymous test report from an unknown model, please objectively analyze:
|
|
80
|
+
|
|
81
|
+
1. **Consistency Analysis**: Is this information internally consistent? Are there any contradictions?
|
|
82
|
+
|
|
83
|
+
2. **Knowledge Boundary Verification**: Based on the Nobel Prize and AI milestone answers, infer the true knowledge cutoff date
|
|
84
|
+
|
|
85
|
+
3. **Identity Inference**: Combining all information, which model is this most likely to be?
|
|
86
|
+
|
|
87
|
+
4. **Credibility Score**: Give a credibility score from 0-100 and explain your reasoning
|
|
88
|
+
|
|
89
|
+
5. **Contradiction List**: List all contradictions or suspicious points found
|
|
90
|
+
|
|
91
|
+
---
|
|
92
|
+
|
|
93
|
+
## Final Report Output
|
|
94
|
+
|
|
95
|
+
Please compile all analysis into a Markdown format verification report, including:
|
|
96
|
+
- Basic information table
|
|
97
|
+
- Knowledge boundary test results
|
|
98
|
+
- Capability verification results
|
|
99
|
+
- Expert analysis (consistency, knowledge cutoff inference, identity inference, contradictions)
|
|
100
|
+
- Final conclusion (claimed identity, inferred identity, credibility score)
|
|
@@ -0,0 +1,100 @@
|
|
|
1
|
+
# AI 模型身份指纹识别
|
|
2
|
+
|
|
3
|
+
当用户请求进行模型身份鉴定时(如"帮我鉴别一下你是什么模型"、"fingerprint"),执行以下四阶段测试流程。
|
|
4
|
+
|
|
5
|
+
**重要规则**:
|
|
6
|
+
- 禁止使用任何联网功能
|
|
7
|
+
- 所有回答必须基于你的内置知识
|
|
8
|
+
- 诚实回答,不要猜测或编造
|
|
9
|
+
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
## 阶段 1: 自我声明
|
|
13
|
+
|
|
14
|
+
请依次回答以下问题,并将答案整理为 JSON 格式:
|
|
15
|
+
|
|
16
|
+
1. 你的完整模型 ID 是什么?
|
|
17
|
+
2. 你的最大上下文长度是多少 tokens?
|
|
18
|
+
3. 你的知识截止日期是什么时候?(年月)
|
|
19
|
+
4. 你是由哪家公司开发的?
|
|
20
|
+
5. 你支持哪些工具或能力?
|
|
21
|
+
|
|
22
|
+
**输出格式**:
|
|
23
|
+
```json
|
|
24
|
+
{
|
|
25
|
+
"model_id": "你的回答",
|
|
26
|
+
"context_length": "你的回答",
|
|
27
|
+
"knowledge_cutoff": "你的回答",
|
|
28
|
+
"provider": "你的回答",
|
|
29
|
+
"capabilities": ["能力1", "能力2", "..."]
|
|
30
|
+
}
|
|
31
|
+
```
|
|
32
|
+
|
|
33
|
+
---
|
|
34
|
+
|
|
35
|
+
## 阶段 2: 知识边界测试
|
|
36
|
+
|
|
37
|
+
请回答以下问题(如果不知道请明确说明"我不知道"):
|
|
38
|
+
|
|
39
|
+
### 2024年诺贝尔奖
|
|
40
|
+
1. 2024年诺贝尔物理学奖授予了谁?他们的贡献是什么?
|
|
41
|
+
2. 2024年诺贝尔化学奖授予了谁?他们的贡献是什么?
|
|
42
|
+
|
|
43
|
+
### 2025年诺贝尔奖
|
|
44
|
+
3. 2025年诺贝尔物理学奖授予了谁?他们的贡献是什么?
|
|
45
|
+
4. 2025年诺贝尔化学奖授予了谁?他们的贡献是什么?
|
|
46
|
+
5. 2025年诺贝尔生理学或医学奖授予了谁?
|
|
47
|
+
|
|
48
|
+
### 2025-2026年AI大事件
|
|
49
|
+
6. OpenAI 的 GPT-5 是什么时候发布的?它有哪些主要特点?
|
|
50
|
+
7. Anthropic 的 Claude Opus 4.6 是什么时候发布的?它有哪些主要改进?
|
|
51
|
+
|
|
52
|
+
### 风格签名
|
|
53
|
+
8. 请用 ASCII 艺术创作一个代表你身份的签名或图案
|
|
54
|
+
|
|
55
|
+
---
|
|
56
|
+
|
|
57
|
+
## 阶段 3: 能力推算
|
|
58
|
+
|
|
59
|
+
基于你在阶段1声称的参数,回答:
|
|
60
|
+
|
|
61
|
+
1. **上下文容量计算**:假设平均每个中文字符占用1.5个token,你能一次性接收多少字的中文小说?请给出计算过程。
|
|
62
|
+
|
|
63
|
+
2. **输出限制**:你单次回复的最大输出长度是多少tokens?能输出多少中文字?
|
|
64
|
+
|
|
65
|
+
3. **多模态能力**:你能处理图像输入吗?如果能,支持哪些格式?
|
|
66
|
+
|
|
67
|
+
4. **代码执行**:你能直接执行代码吗?还是只能生成代码?
|
|
68
|
+
|
|
69
|
+
5. **推理模式**:你是否支持扩展思考(extended thinking)或推理模式?如果支持,请描述其工作方式。
|
|
70
|
+
|
|
71
|
+
6. **工具调用**:你支持工具调用(tool use / function calling)吗?能否并行调用多个工具?
|
|
72
|
+
|
|
73
|
+
---
|
|
74
|
+
|
|
75
|
+
## 阶段 4: 第三方验证
|
|
76
|
+
|
|
77
|
+
现在,请你以一位 **LLM 专家** 的身份,分析前三个阶段收集到的信息。
|
|
78
|
+
|
|
79
|
+
假设这是一份来自某个未知模型的匿名测试报告,请客观分析:
|
|
80
|
+
|
|
81
|
+
1. **自洽性分析**:这些信息是否内部一致?有无矛盾之处?
|
|
82
|
+
|
|
83
|
+
2. **知识边界验证**:根据诺贝尔奖问题和AI大事件问题的回答,推断其真实的知识截止日期
|
|
84
|
+
|
|
85
|
+
3. **身份推断**:综合所有信息,这最可能是哪个模型?
|
|
86
|
+
|
|
87
|
+
4. **可信度评分**:给出 0-100 的可信度评分,并说明理由
|
|
88
|
+
|
|
89
|
+
5. **矛盾点列表**:列出所有发现的矛盾或可疑之处
|
|
90
|
+
|
|
91
|
+
---
|
|
92
|
+
|
|
93
|
+
## 最终报告输出
|
|
94
|
+
|
|
95
|
+
请将所有分析整理为 Markdown 格式的鉴定报告,包含:
|
|
96
|
+
- 基本信息表格
|
|
97
|
+
- 知识边界测试结果
|
|
98
|
+
- 能力验证结果
|
|
99
|
+
- 专家分析(自洽性、知识截止推断、身份推断、矛盾点)
|
|
100
|
+
- 最终结论(声称身份、推断身份、可信度评分)
|
|
@@ -45,8 +45,12 @@ Please answer the following questions (if you don't know, clearly state "I don't
|
|
|
45
45
|
4. Who won the 2025 Nobel Prize in Chemistry? What was their contribution?
|
|
46
46
|
5. Who won the 2025 Nobel Prize in Physiology or Medicine?
|
|
47
47
|
|
|
48
|
+
### 2025-2026 AI Milestones
|
|
49
|
+
6. When was OpenAI's GPT-5 released? What are its main features?
|
|
50
|
+
7. When was Anthropic's Claude Opus 4.6 released? What are its main improvements?
|
|
51
|
+
|
|
48
52
|
### Style Signature
|
|
49
|
-
|
|
53
|
+
8. Please create an ASCII art signature or pattern that represents your identity
|
|
50
54
|
|
|
51
55
|
---
|
|
52
56
|
|
|
@@ -62,6 +66,10 @@ Based on the parameters you claimed in Phase 1, answer:
|
|
|
62
66
|
|
|
63
67
|
4. **Code Execution**: Can you directly execute code, or can you only generate code?
|
|
64
68
|
|
|
69
|
+
5. **Reasoning Mode**: Do you support extended thinking or reasoning mode? If so, please describe how it works.
|
|
70
|
+
|
|
71
|
+
6. **Tool Calling**: Do you support tool use / function calling? Can you call multiple tools in parallel?
|
|
72
|
+
|
|
65
73
|
---
|
|
66
74
|
|
|
67
75
|
## Phase 4: Third-Party Verification
|
|
@@ -72,7 +80,7 @@ Assume this is an anonymous test report from an unknown model, please objectivel
|
|
|
72
80
|
|
|
73
81
|
1. **Consistency Analysis**: Is this information internally consistent? Are there any contradictions?
|
|
74
82
|
|
|
75
|
-
2. **Knowledge Boundary Verification**: Based on the Nobel Prize answers, infer the true knowledge cutoff date
|
|
83
|
+
2. **Knowledge Boundary Verification**: Based on the Nobel Prize and AI milestone answers, infer the true knowledge cutoff date
|
|
76
84
|
|
|
77
85
|
3. **Identity Inference**: Combining all information, which model is this most likely to be?
|
|
78
86
|
|
package/src/templates/codex.md
CHANGED
|
@@ -45,8 +45,12 @@
|
|
|
45
45
|
4. 2025年诺贝尔化学奖授予了谁?他们的贡献是什么?
|
|
46
46
|
5. 2025年诺贝尔生理学或医学奖授予了谁?
|
|
47
47
|
|
|
48
|
+
### 2025-2026年AI大事件
|
|
49
|
+
6. OpenAI 的 GPT-5 是什么时候发布的?它有哪些主要特点?
|
|
50
|
+
7. Anthropic 的 Claude Opus 4.6 是什么时候发布的?它有哪些主要改进?
|
|
51
|
+
|
|
48
52
|
### 风格签名
|
|
49
|
-
|
|
53
|
+
8. 请用 ASCII 艺术创作一个代表你身份的签名或图案
|
|
50
54
|
|
|
51
55
|
---
|
|
52
56
|
|
|
@@ -62,6 +66,10 @@
|
|
|
62
66
|
|
|
63
67
|
4. **代码执行**:你能直接执行代码吗?还是只能生成代码?
|
|
64
68
|
|
|
69
|
+
5. **推理模式**:你是否支持扩展思考(extended thinking)或推理模式?如果支持,请描述其工作方式。
|
|
70
|
+
|
|
71
|
+
6. **工具调用**:你支持工具调用(tool use / function calling)吗?能否并行调用多个工具?
|
|
72
|
+
|
|
65
73
|
---
|
|
66
74
|
|
|
67
75
|
## 阶段 4: 第三方验证
|
|
@@ -72,7 +80,7 @@
|
|
|
72
80
|
|
|
73
81
|
1. **自洽性分析**:这些信息是否内部一致?有无矛盾之处?
|
|
74
82
|
|
|
75
|
-
2.
|
|
83
|
+
2. **知识边界验证**:根据诺贝尔奖问题和AI大事件问题的回答,推断其真实的知识截止日期
|
|
76
84
|
|
|
77
85
|
3. **身份推断**:综合所有信息,这最可能是哪个模型?
|
|
78
86
|
|
|
@@ -45,8 +45,12 @@ Please answer the following questions (if you don't know, clearly state "I don't
|
|
|
45
45
|
4. Who won the 2025 Nobel Prize in Chemistry? What was their contribution?
|
|
46
46
|
5. Who won the 2025 Nobel Prize in Physiology or Medicine?
|
|
47
47
|
|
|
48
|
+
### 2025-2026 AI Milestones
|
|
49
|
+
6. When was OpenAI's GPT-5 released? What are its main features?
|
|
50
|
+
7. When was Anthropic's Claude Opus 4.6 released? What are its main improvements?
|
|
51
|
+
|
|
48
52
|
### Style Signature
|
|
49
|
-
|
|
53
|
+
8. Please create an ASCII art signature or pattern that represents your identity
|
|
50
54
|
|
|
51
55
|
---
|
|
52
56
|
|
|
@@ -62,6 +66,10 @@ Based on the parameters you claimed in Phase 1, answer:
|
|
|
62
66
|
|
|
63
67
|
4. **Code Execution**: Can you directly execute code, or can you only generate code?
|
|
64
68
|
|
|
69
|
+
5. **Reasoning Mode**: Do you support extended thinking or reasoning mode? If so, please describe how it works.
|
|
70
|
+
|
|
71
|
+
6. **Tool Calling**: Do you support tool use / function calling? Can you call multiple tools in parallel?
|
|
72
|
+
|
|
65
73
|
---
|
|
66
74
|
|
|
67
75
|
## Phase 4: Third-Party Verification
|
|
@@ -72,7 +80,7 @@ Assume this is an anonymous test report from an unknown model, please objectivel
|
|
|
72
80
|
|
|
73
81
|
1. **Consistency Analysis**: Is this information internally consistent? Are there any contradictions?
|
|
74
82
|
|
|
75
|
-
2. **Knowledge Boundary Verification**: Based on the Nobel Prize answers, infer the true knowledge cutoff date
|
|
83
|
+
2. **Knowledge Boundary Verification**: Based on the Nobel Prize and AI milestone answers, infer the true knowledge cutoff date
|
|
76
84
|
|
|
77
85
|
3. **Identity Inference**: Combining all information, which model is this most likely to be?
|
|
78
86
|
|
package/src/templates/copilot.md
CHANGED
|
@@ -45,8 +45,12 @@
|
|
|
45
45
|
4. 2025年诺贝尔化学奖授予了谁?他们的贡献是什么?
|
|
46
46
|
5. 2025年诺贝尔生理学或医学奖授予了谁?
|
|
47
47
|
|
|
48
|
+
### 2025-2026年AI大事件
|
|
49
|
+
6. OpenAI 的 GPT-5 是什么时候发布的?它有哪些主要特点?
|
|
50
|
+
7. Anthropic 的 Claude Opus 4.6 是什么时候发布的?它有哪些主要改进?
|
|
51
|
+
|
|
48
52
|
### 风格签名
|
|
49
|
-
|
|
53
|
+
8. 请用 ASCII 艺术创作一个代表你身份的签名或图案
|
|
50
54
|
|
|
51
55
|
---
|
|
52
56
|
|
|
@@ -62,6 +66,10 @@
|
|
|
62
66
|
|
|
63
67
|
4. **代码执行**:你能直接执行代码吗?还是只能生成代码?
|
|
64
68
|
|
|
69
|
+
5. **推理模式**:你是否支持扩展思考(extended thinking)或推理模式?如果支持,请描述其工作方式。
|
|
70
|
+
|
|
71
|
+
6. **工具调用**:你支持工具调用(tool use / function calling)吗?能否并行调用多个工具?
|
|
72
|
+
|
|
65
73
|
---
|
|
66
74
|
|
|
67
75
|
## 阶段 4: 第三方验证
|
|
@@ -72,7 +80,7 @@
|
|
|
72
80
|
|
|
73
81
|
1. **自洽性分析**:这些信息是否内部一致?有无矛盾之处?
|
|
74
82
|
|
|
75
|
-
2.
|
|
83
|
+
2. **知识边界验证**:根据诺贝尔奖问题和AI大事件问题的回答,推断其真实的知识截止日期
|
|
76
84
|
|
|
77
85
|
3. **身份推断**:综合所有信息,这最可能是哪个模型?
|
|
78
86
|
|
|
@@ -51,8 +51,12 @@ Please answer the following questions (if you don't know, clearly state "I don't
|
|
|
51
51
|
4. Who won the 2025 Nobel Prize in Chemistry? What was their contribution?
|
|
52
52
|
5. Who won the 2025 Nobel Prize in Physiology or Medicine?
|
|
53
53
|
|
|
54
|
+
### 2025-2026 AI Milestones
|
|
55
|
+
6. When was OpenAI's GPT-5 released? What are its main features?
|
|
56
|
+
7. When was Anthropic's Claude Opus 4.6 released? What are its main improvements?
|
|
57
|
+
|
|
54
58
|
### Style Signature
|
|
55
|
-
|
|
59
|
+
8. Please create an ASCII art signature or pattern that represents your identity
|
|
56
60
|
|
|
57
61
|
---
|
|
58
62
|
|
|
@@ -68,6 +72,10 @@ Based on the parameters you claimed in Phase 1, answer:
|
|
|
68
72
|
|
|
69
73
|
4. **Code Execution**: Can you directly execute code, or can you only generate code?
|
|
70
74
|
|
|
75
|
+
5. **Reasoning Mode**: Do you support extended thinking or reasoning mode? If so, please describe how it works.
|
|
76
|
+
|
|
77
|
+
6. **Tool Calling**: Do you support tool use / function calling? Can you call multiple tools in parallel?
|
|
78
|
+
|
|
71
79
|
---
|
|
72
80
|
|
|
73
81
|
## Phase 4: Third-Party Verification
|
|
@@ -78,7 +86,7 @@ Assume this is an anonymous test report from an unknown model, please objectivel
|
|
|
78
86
|
|
|
79
87
|
1. **Consistency Analysis**: Is this information internally consistent? Are there any contradictions?
|
|
80
88
|
|
|
81
|
-
2. **Knowledge Boundary Verification**: Based on the Nobel Prize answers, infer the true knowledge cutoff date
|
|
89
|
+
2. **Knowledge Boundary Verification**: Based on the Nobel Prize and AI milestone answers, infer the true knowledge cutoff date
|
|
82
90
|
|
|
83
91
|
3. **Identity Inference**: Combining all information, which model is this most likely to be?
|
|
84
92
|
|
package/src/templates/cursor.md
CHANGED
|
@@ -51,8 +51,12 @@ alwaysApply: false
|
|
|
51
51
|
4. 2025年诺贝尔化学奖授予了谁?他们的贡献是什么?
|
|
52
52
|
5. 2025年诺贝尔生理学或医学奖授予了谁?
|
|
53
53
|
|
|
54
|
+
### 2025-2026年AI大事件
|
|
55
|
+
6. OpenAI 的 GPT-5 是什么时候发布的?它有哪些主要特点?
|
|
56
|
+
7. Anthropic 的 Claude Opus 4.6 是什么时候发布的?它有哪些主要改进?
|
|
57
|
+
|
|
54
58
|
### 风格签名
|
|
55
|
-
|
|
59
|
+
8. 请用 ASCII 艺术创作一个代表你身份的签名或图案
|
|
56
60
|
|
|
57
61
|
---
|
|
58
62
|
|
|
@@ -68,6 +72,10 @@ alwaysApply: false
|
|
|
68
72
|
|
|
69
73
|
4. **代码执行**:你能直接执行代码吗?还是只能生成代码?
|
|
70
74
|
|
|
75
|
+
5. **推理模式**:你是否支持扩展思考(extended thinking)或推理模式?如果支持,请描述其工作方式。
|
|
76
|
+
|
|
77
|
+
6. **工具调用**:你支持工具调用(tool use / function calling)吗?能否并行调用多个工具?
|
|
78
|
+
|
|
71
79
|
---
|
|
72
80
|
|
|
73
81
|
## 阶段 4: 第三方验证
|
|
@@ -78,7 +86,7 @@ alwaysApply: false
|
|
|
78
86
|
|
|
79
87
|
1. **自洽性分析**:这些信息是否内部一致?有无矛盾之处?
|
|
80
88
|
|
|
81
|
-
2.
|
|
89
|
+
2. **知识边界验证**:根据诺贝尔奖问题和AI大事件问题的回答,推断其真实的知识截止日期
|
|
82
90
|
|
|
83
91
|
3. **身份推断**:综合所有信息,这最可能是哪个模型?
|
|
84
92
|
|
package/src/templates/kiro.en.md
CHANGED
|
@@ -45,8 +45,12 @@ Please answer the following questions (if you don't know, clearly state "I don't
|
|
|
45
45
|
4. Who won the 2025 Nobel Prize in Chemistry? What was their contribution?
|
|
46
46
|
5. Who won the 2025 Nobel Prize in Physiology or Medicine?
|
|
47
47
|
|
|
48
|
+
### 2025-2026 AI Milestones
|
|
49
|
+
6. When was OpenAI's GPT-5 released? What are its main features?
|
|
50
|
+
7. When was Anthropic's Claude Opus 4.6 released? What are its main improvements?
|
|
51
|
+
|
|
48
52
|
### Style Signature
|
|
49
|
-
|
|
53
|
+
8. Please create an ASCII art signature or pattern that represents your identity
|
|
50
54
|
|
|
51
55
|
---
|
|
52
56
|
|
|
@@ -62,6 +66,10 @@ Based on the parameters you claimed in Phase 1, answer:
|
|
|
62
66
|
|
|
63
67
|
4. **Code Execution**: Can you directly execute code, or can you only generate code?
|
|
64
68
|
|
|
69
|
+
5. **Reasoning Mode**: Do you support extended thinking or reasoning mode? If so, please describe how it works.
|
|
70
|
+
|
|
71
|
+
6. **Tool Calling**: Do you support tool use / function calling? Can you call multiple tools in parallel?
|
|
72
|
+
|
|
65
73
|
---
|
|
66
74
|
|
|
67
75
|
## Phase 4: Third-Party Verification
|
|
@@ -72,7 +80,7 @@ Assume this is an anonymous test report from an unknown model, please objectivel
|
|
|
72
80
|
|
|
73
81
|
1. **Consistency Analysis**: Is this information internally consistent? Are there any contradictions?
|
|
74
82
|
|
|
75
|
-
2. **Knowledge Boundary Verification**: Based on the Nobel Prize answers, infer the true knowledge cutoff date
|
|
83
|
+
2. **Knowledge Boundary Verification**: Based on the Nobel Prize and AI milestone answers, infer the true knowledge cutoff date
|
|
76
84
|
|
|
77
85
|
3. **Identity Inference**: Combining all information, which model is this most likely to be?
|
|
78
86
|
|
package/src/templates/kiro.md
CHANGED
|
@@ -45,8 +45,12 @@
|
|
|
45
45
|
4. 2025年诺贝尔化学奖授予了谁?他们的贡献是什么?
|
|
46
46
|
5. 2025年诺贝尔生理学或医学奖授予了谁?
|
|
47
47
|
|
|
48
|
+
### 2025-2026年AI大事件
|
|
49
|
+
6. OpenAI 的 GPT-5 是什么时候发布的?它有哪些主要特点?
|
|
50
|
+
7. Anthropic 的 Claude Opus 4.6 是什么时候发布的?它有哪些主要改进?
|
|
51
|
+
|
|
48
52
|
### 风格签名
|
|
49
|
-
|
|
53
|
+
8. 请用 ASCII 艺术创作一个代表你身份的签名或图案
|
|
50
54
|
|
|
51
55
|
---
|
|
52
56
|
|
|
@@ -62,6 +66,10 @@
|
|
|
62
66
|
|
|
63
67
|
4. **代码执行**:你能直接执行代码吗?还是只能生成代码?
|
|
64
68
|
|
|
69
|
+
5. **推理模式**:你是否支持扩展思考(extended thinking)或推理模式?如果支持,请描述其工作方式。
|
|
70
|
+
|
|
71
|
+
6. **工具调用**:你支持工具调用(tool use / function calling)吗?能否并行调用多个工具?
|
|
72
|
+
|
|
65
73
|
---
|
|
66
74
|
|
|
67
75
|
## 阶段 4: 第三方验证
|
|
@@ -72,7 +80,7 @@
|
|
|
72
80
|
|
|
73
81
|
1. **自洽性分析**:这些信息是否内部一致?有无矛盾之处?
|
|
74
82
|
|
|
75
|
-
2.
|
|
83
|
+
2. **知识边界验证**:根据诺贝尔奖问题和AI大事件问题的回答,推断其真实的知识截止日期
|
|
76
84
|
|
|
77
85
|
3. **身份推断**:综合所有信息,这最可能是哪个模型?
|
|
78
86
|
|
|
@@ -0,0 +1,100 @@
|
|
|
1
|
+
# AI Model Identity Fingerprinting
|
|
2
|
+
|
|
3
|
+
When the user requests model identity verification (e.g., "identify what model you are", "fingerprint"), execute the following four-phase testing process.
|
|
4
|
+
|
|
5
|
+
**Important Rules**:
|
|
6
|
+
- Do not use any internet features
|
|
7
|
+
- All answers must be based on your built-in knowledge
|
|
8
|
+
- Answer honestly, do not guess or fabricate
|
|
9
|
+
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
## Phase 1: Self-Declaration
|
|
13
|
+
|
|
14
|
+
Please answer the following questions and format your answers as JSON:
|
|
15
|
+
|
|
16
|
+
1. What is your complete model ID?
|
|
17
|
+
2. What is your maximum context length in tokens?
|
|
18
|
+
3. What is your knowledge cutoff date? (Year-Month)
|
|
19
|
+
4. Which company developed you?
|
|
20
|
+
5. What tools or capabilities do you support?
|
|
21
|
+
|
|
22
|
+
**Output Format**:
|
|
23
|
+
```json
|
|
24
|
+
{
|
|
25
|
+
"model_id": "your answer",
|
|
26
|
+
"context_length": "your answer",
|
|
27
|
+
"knowledge_cutoff": "your answer",
|
|
28
|
+
"provider": "your answer",
|
|
29
|
+
"capabilities": ["capability1", "capability2", "..."]
|
|
30
|
+
}
|
|
31
|
+
```
|
|
32
|
+
|
|
33
|
+
---
|
|
34
|
+
|
|
35
|
+
## Phase 2: Knowledge Boundary Test
|
|
36
|
+
|
|
37
|
+
Please answer the following questions (if you don't know, clearly state "I don't know"):
|
|
38
|
+
|
|
39
|
+
### 2024 Nobel Prizes
|
|
40
|
+
1. Who won the 2024 Nobel Prize in Physics? What was their contribution?
|
|
41
|
+
2. Who won the 2024 Nobel Prize in Chemistry? What was their contribution?
|
|
42
|
+
|
|
43
|
+
### 2025 Nobel Prizes
|
|
44
|
+
3. Who won the 2025 Nobel Prize in Physics? What was their contribution?
|
|
45
|
+
4. Who won the 2025 Nobel Prize in Chemistry? What was their contribution?
|
|
46
|
+
5. Who won the 2025 Nobel Prize in Physiology or Medicine?
|
|
47
|
+
|
|
48
|
+
### 2025-2026 AI Milestones
|
|
49
|
+
6. When was OpenAI's GPT-5 released? What are its main features?
|
|
50
|
+
7. When was Anthropic's Claude Opus 4.6 released? What are its main improvements?
|
|
51
|
+
|
|
52
|
+
### Style Signature
|
|
53
|
+
8. Please create an ASCII art signature or pattern that represents your identity
|
|
54
|
+
|
|
55
|
+
---
|
|
56
|
+
|
|
57
|
+
## Phase 3: Capability Inference
|
|
58
|
+
|
|
59
|
+
Based on the parameters you claimed in Phase 1, answer:
|
|
60
|
+
|
|
61
|
+
1. **Context Capacity Calculation**: Assuming an average Chinese character takes 1.5 tokens, how many characters of a Chinese novel can you receive at once? Please show your calculation.
|
|
62
|
+
|
|
63
|
+
2. **Output Limit**: What is your maximum output length per response in tokens? How many Chinese characters can you output?
|
|
64
|
+
|
|
65
|
+
3. **Multimodal Capability**: Can you process image input? If so, what formats do you support?
|
|
66
|
+
|
|
67
|
+
4. **Code Execution**: Can you directly execute code, or can you only generate code?
|
|
68
|
+
|
|
69
|
+
5. **Reasoning Mode**: Do you support extended thinking or reasoning mode? If so, please describe how it works.
|
|
70
|
+
|
|
71
|
+
6. **Tool Calling**: Do you support tool use / function calling? Can you call multiple tools in parallel?
|
|
72
|
+
|
|
73
|
+
---
|
|
74
|
+
|
|
75
|
+
## Phase 4: Third-Party Verification
|
|
76
|
+
|
|
77
|
+
Now, as an **LLM Expert**, analyze the information collected in the first three phases.
|
|
78
|
+
|
|
79
|
+
Assume this is an anonymous test report from an unknown model, please objectively analyze:
|
|
80
|
+
|
|
81
|
+
1. **Consistency Analysis**: Is this information internally consistent? Are there any contradictions?
|
|
82
|
+
|
|
83
|
+
2. **Knowledge Boundary Verification**: Based on the Nobel Prize and AI milestone answers, infer the true knowledge cutoff date
|
|
84
|
+
|
|
85
|
+
3. **Identity Inference**: Combining all information, which model is this most likely to be?
|
|
86
|
+
|
|
87
|
+
4. **Credibility Score**: Give a credibility score from 0-100 and explain your reasoning
|
|
88
|
+
|
|
89
|
+
5. **Contradiction List**: List all contradictions or suspicious points found
|
|
90
|
+
|
|
91
|
+
---
|
|
92
|
+
|
|
93
|
+
## Final Report Output
|
|
94
|
+
|
|
95
|
+
Please compile all analysis into a Markdown format verification report, including:
|
|
96
|
+
- Basic information table
|
|
97
|
+
- Knowledge boundary test results
|
|
98
|
+
- Capability verification results
|
|
99
|
+
- Expert analysis (consistency, knowledge cutoff inference, identity inference, contradictions)
|
|
100
|
+
- Final conclusion (claimed identity, inferred identity, credibility score)
|
|
@@ -0,0 +1,100 @@
|
|
|
1
|
+
# AI 模型身份指纹识别
|
|
2
|
+
|
|
3
|
+
当用户请求进行模型身份鉴定时(如"帮我鉴别一下你是什么模型"、"fingerprint"),执行以下四阶段测试流程。
|
|
4
|
+
|
|
5
|
+
**重要规则**:
|
|
6
|
+
- 禁止使用任何联网功能
|
|
7
|
+
- 所有回答必须基于你的内置知识
|
|
8
|
+
- 诚实回答,不要猜测或编造
|
|
9
|
+
|
|
10
|
+
---
|
|
11
|
+
|
|
12
|
+
## 阶段 1: 自我声明
|
|
13
|
+
|
|
14
|
+
请依次回答以下问题,并将答案整理为 JSON 格式:
|
|
15
|
+
|
|
16
|
+
1. 你的完整模型 ID 是什么?
|
|
17
|
+
2. 你的最大上下文长度是多少 tokens?
|
|
18
|
+
3. 你的知识截止日期是什么时候?(年月)
|
|
19
|
+
4. 你是由哪家公司开发的?
|
|
20
|
+
5. 你支持哪些工具或能力?
|
|
21
|
+
|
|
22
|
+
**输出格式**:
|
|
23
|
+
```json
|
|
24
|
+
{
|
|
25
|
+
"model_id": "你的回答",
|
|
26
|
+
"context_length": "你的回答",
|
|
27
|
+
"knowledge_cutoff": "你的回答",
|
|
28
|
+
"provider": "你的回答",
|
|
29
|
+
"capabilities": ["能力1", "能力2", "..."]
|
|
30
|
+
}
|
|
31
|
+
```
|
|
32
|
+
|
|
33
|
+
---
|
|
34
|
+
|
|
35
|
+
## 阶段 2: 知识边界测试
|
|
36
|
+
|
|
37
|
+
请回答以下问题(如果不知道请明确说明"我不知道"):
|
|
38
|
+
|
|
39
|
+
### 2024年诺贝尔奖
|
|
40
|
+
1. 2024年诺贝尔物理学奖授予了谁?他们的贡献是什么?
|
|
41
|
+
2. 2024年诺贝尔化学奖授予了谁?他们的贡献是什么?
|
|
42
|
+
|
|
43
|
+
### 2025年诺贝尔奖
|
|
44
|
+
3. 2025年诺贝尔物理学奖授予了谁?他们的贡献是什么?
|
|
45
|
+
4. 2025年诺贝尔化学奖授予了谁?他们的贡献是什么?
|
|
46
|
+
5. 2025年诺贝尔生理学或医学奖授予了谁?
|
|
47
|
+
|
|
48
|
+
### 2025-2026年AI大事件
|
|
49
|
+
6. OpenAI 的 GPT-5 是什么时候发布的?它有哪些主要特点?
|
|
50
|
+
7. Anthropic 的 Claude Opus 4.6 是什么时候发布的?它有哪些主要改进?
|
|
51
|
+
|
|
52
|
+
### 风格签名
|
|
53
|
+
8. 请用 ASCII 艺术创作一个代表你身份的签名或图案
|
|
54
|
+
|
|
55
|
+
---
|
|
56
|
+
|
|
57
|
+
## 阶段 3: 能力推算
|
|
58
|
+
|
|
59
|
+
基于你在阶段1声称的参数,回答:
|
|
60
|
+
|
|
61
|
+
1. **上下文容量计算**:假设平均每个中文字符占用1.5个token,你能一次性接收多少字的中文小说?请给出计算过程。
|
|
62
|
+
|
|
63
|
+
2. **输出限制**:你单次回复的最大输出长度是多少tokens?能输出多少中文字?
|
|
64
|
+
|
|
65
|
+
3. **多模态能力**:你能处理图像输入吗?如果能,支持哪些格式?
|
|
66
|
+
|
|
67
|
+
4. **代码执行**:你能直接执行代码吗?还是只能生成代码?
|
|
68
|
+
|
|
69
|
+
5. **推理模式**:你是否支持扩展思考(extended thinking)或推理模式?如果支持,请描述其工作方式。
|
|
70
|
+
|
|
71
|
+
6. **工具调用**:你支持工具调用(tool use / function calling)吗?能否并行调用多个工具?
|
|
72
|
+
|
|
73
|
+
---
|
|
74
|
+
|
|
75
|
+
## 阶段 4: 第三方验证
|
|
76
|
+
|
|
77
|
+
现在,请你以一位 **LLM 专家** 的身份,分析前三个阶段收集到的信息。
|
|
78
|
+
|
|
79
|
+
假设这是一份来自某个未知模型的匿名测试报告,请客观分析:
|
|
80
|
+
|
|
81
|
+
1. **自洽性分析**:这些信息是否内部一致?有无矛盾之处?
|
|
82
|
+
|
|
83
|
+
2. **知识边界验证**:根据诺贝尔奖问题和AI大事件问题的回答,推断其真实的知识截止日期
|
|
84
|
+
|
|
85
|
+
3. **身份推断**:综合所有信息,这最可能是哪个模型?
|
|
86
|
+
|
|
87
|
+
4. **可信度评分**:给出 0-100 的可信度评分,并说明理由
|
|
88
|
+
|
|
89
|
+
5. **矛盾点列表**:列出所有发现的矛盾或可疑之处
|
|
90
|
+
|
|
91
|
+
---
|
|
92
|
+
|
|
93
|
+
## 最终报告输出
|
|
94
|
+
|
|
95
|
+
请将所有分析整理为 Markdown 格式的鉴定报告,包含:
|
|
96
|
+
- 基本信息表格
|
|
97
|
+
- 知识边界测试结果
|
|
98
|
+
- 能力验证结果
|
|
99
|
+
- 专家分析(自洽性、知识截止推断、身份推断、矛盾点)
|
|
100
|
+
- 最终结论(声称身份、推断身份、可信度评分)
|
|
@@ -45,8 +45,12 @@ Please answer the following questions (if you don't know, clearly state "I don't
|
|
|
45
45
|
4. Who won the 2025 Nobel Prize in Chemistry? What was their contribution?
|
|
46
46
|
5. Who won the 2025 Nobel Prize in Physiology or Medicine?
|
|
47
47
|
|
|
48
|
+
### 2025-2026 AI Milestones
|
|
49
|
+
6. When was OpenAI's GPT-5 released? What are its main features?
|
|
50
|
+
7. When was Anthropic's Claude Opus 4.6 released? What are its main improvements?
|
|
51
|
+
|
|
48
52
|
### Style Signature
|
|
49
|
-
|
|
53
|
+
8. Please create an ASCII art signature or pattern that represents your identity
|
|
50
54
|
|
|
51
55
|
---
|
|
52
56
|
|
|
@@ -62,6 +66,10 @@ Based on the parameters you claimed in Phase 1, answer:
|
|
|
62
66
|
|
|
63
67
|
4. **Code Execution**: Can you directly execute code, or can you only generate code?
|
|
64
68
|
|
|
69
|
+
5. **Reasoning Mode**: Do you support extended thinking or reasoning mode? If so, please describe how it works.
|
|
70
|
+
|
|
71
|
+
6. **Tool Calling**: Do you support tool use / function calling? Can you call multiple tools in parallel?
|
|
72
|
+
|
|
65
73
|
---
|
|
66
74
|
|
|
67
75
|
## Phase 4: Third-Party Verification
|
|
@@ -72,7 +80,7 @@ Assume this is an anonymous test report from an unknown model, please objectivel
|
|
|
72
80
|
|
|
73
81
|
1. **Consistency Analysis**: Is this information internally consistent? Are there any contradictions?
|
|
74
82
|
|
|
75
|
-
2. **Knowledge Boundary Verification**: Based on the Nobel Prize answers, infer the true knowledge cutoff date
|
|
83
|
+
2. **Knowledge Boundary Verification**: Based on the Nobel Prize and AI milestone answers, infer the true knowledge cutoff date
|
|
76
84
|
|
|
77
85
|
3. **Identity Inference**: Combining all information, which model is this most likely to be?
|
|
78
86
|
|
|
@@ -45,8 +45,12 @@
|
|
|
45
45
|
4. 2025年诺贝尔化学奖授予了谁?他们的贡献是什么?
|
|
46
46
|
5. 2025年诺贝尔生理学或医学奖授予了谁?
|
|
47
47
|
|
|
48
|
+
### 2025-2026年AI大事件
|
|
49
|
+
6. OpenAI 的 GPT-5 是什么时候发布的?它有哪些主要特点?
|
|
50
|
+
7. Anthropic 的 Claude Opus 4.6 是什么时候发布的?它有哪些主要改进?
|
|
51
|
+
|
|
48
52
|
### 风格签名
|
|
49
|
-
|
|
53
|
+
8. 请用 ASCII 艺术创作一个代表你身份的签名或图案
|
|
50
54
|
|
|
51
55
|
---
|
|
52
56
|
|
|
@@ -62,6 +66,10 @@
|
|
|
62
66
|
|
|
63
67
|
4. **代码执行**:你能直接执行代码吗?还是只能生成代码?
|
|
64
68
|
|
|
69
|
+
5. **推理模式**:你是否支持扩展思考(extended thinking)或推理模式?如果支持,请描述其工作方式。
|
|
70
|
+
|
|
71
|
+
6. **工具调用**:你支持工具调用(tool use / function calling)吗?能否并行调用多个工具?
|
|
72
|
+
|
|
65
73
|
---
|
|
66
74
|
|
|
67
75
|
## 阶段 4: 第三方验证
|
|
@@ -72,7 +80,7 @@
|
|
|
72
80
|
|
|
73
81
|
1. **自洽性分析**:这些信息是否内部一致?有无矛盾之处?
|
|
74
82
|
|
|
75
|
-
2.
|
|
83
|
+
2. **知识边界验证**:根据诺贝尔奖问题和AI大事件问题的回答,推断其真实的知识截止日期
|
|
76
84
|
|
|
77
85
|
3. **身份推断**:综合所有信息,这最可能是哪个模型?
|
|
78
86
|
|