@bgicli/bgicli 2.2.8 → 2.2.10
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/data/skills/anthropic-algorithmic-art/SKILL.md +405 -0
- package/data/skills/anthropic-canvas-design/SKILL.md +130 -0
- package/data/skills/anthropic-claude-api/SKILL.md +243 -0
- package/data/skills/anthropic-doc-coauthoring/SKILL.md +375 -0
- package/data/skills/anthropic-docx/SKILL.md +590 -0
- package/data/skills/anthropic-frontend-design/SKILL.md +42 -0
- package/data/skills/anthropic-internal-comms/SKILL.md +32 -0
- package/data/skills/anthropic-mcp-builder/SKILL.md +236 -0
- package/data/skills/anthropic-pdf/SKILL.md +314 -0
- package/data/skills/anthropic-pptx/SKILL.md +232 -0
- package/data/skills/anthropic-skill-creator/SKILL.md +485 -0
- package/data/skills/anthropic-webapp-testing/SKILL.md +96 -0
- package/data/skills/anthropic-xlsx/SKILL.md +292 -0
- package/data/skills/arxiv-database/SKILL.md +362 -0
- package/data/skills/astropy/SKILL.md +329 -0
- package/data/skills/ctx-advanced-evaluation/SKILL.md +402 -0
- package/data/skills/ctx-bdi-mental-states/SKILL.md +311 -0
- package/data/skills/ctx-context-compression/SKILL.md +272 -0
- package/data/skills/ctx-context-degradation/SKILL.md +206 -0
- package/data/skills/ctx-context-fundamentals/SKILL.md +201 -0
- package/data/skills/ctx-context-optimization/SKILL.md +195 -0
- package/data/skills/ctx-evaluation/SKILL.md +251 -0
- package/data/skills/ctx-filesystem-context/SKILL.md +287 -0
- package/data/skills/ctx-hosted-agents/SKILL.md +260 -0
- package/data/skills/ctx-memory-systems/SKILL.md +225 -0
- package/data/skills/ctx-multi-agent-patterns/SKILL.md +257 -0
- package/data/skills/ctx-project-development/SKILL.md +291 -0
- package/data/skills/ctx-tool-design/SKILL.md +271 -0
- package/data/skills/dhdna-profiler/SKILL.md +162 -0
- package/data/skills/generate-image/SKILL.md +183 -0
- package/data/skills/geomaster/SKILL.md +365 -0
- package/data/skills/get-available-resources/SKILL.md +275 -0
- package/data/skills/hamelsmu-build-review-interface/SKILL.md +96 -0
- package/data/skills/hamelsmu-error-analysis/SKILL.md +164 -0
- package/data/skills/hamelsmu-eval-audit/SKILL.md +183 -0
- package/data/skills/hamelsmu-evaluate-rag/SKILL.md +177 -0
- package/data/skills/hamelsmu-generate-synthetic-data/SKILL.md +131 -0
- package/data/skills/hamelsmu-validate-evaluator/SKILL.md +212 -0
- package/data/skills/hamelsmu-write-judge-prompt/SKILL.md +144 -0
- package/data/skills/hf-cli/SKILL.md +174 -0
- package/data/skills/hf-mcp/SKILL.md +178 -0
- package/data/skills/hugging-face-dataset-viewer/SKILL.md +121 -0
- package/data/skills/hugging-face-datasets/SKILL.md +542 -0
- package/data/skills/hugging-face-evaluation/SKILL.md +651 -0
- package/data/skills/hugging-face-jobs/SKILL.md +1042 -0
- package/data/skills/hugging-face-model-trainer/SKILL.md +717 -0
- package/data/skills/hugging-face-paper-pages/SKILL.md +239 -0
- package/data/skills/hugging-face-paper-publisher/SKILL.md +624 -0
- package/data/skills/hugging-face-tool-builder/SKILL.md +110 -0
- package/data/skills/hugging-face-trackio/SKILL.md +115 -0
- package/data/skills/hugging-face-vision-trainer/SKILL.md +593 -0
- package/data/skills/huggingface-gradio/SKILL.md +245 -0
- package/data/skills/matlab/SKILL.md +376 -0
- package/data/skills/modal/SKILL.md +381 -0
- package/data/skills/openai-cloudflare-deploy/SKILL.md +224 -0
- package/data/skills/openai-develop-web-game/SKILL.md +149 -0
- package/data/skills/openai-doc/SKILL.md +80 -0
- package/data/skills/openai-figma/SKILL.md +42 -0
- package/data/skills/openai-figma-implement-design/SKILL.md +264 -0
- package/data/skills/openai-gh-address-comments/SKILL.md +25 -0
- package/data/skills/openai-gh-fix-ci/SKILL.md +69 -0
- package/data/skills/openai-imagegen/SKILL.md +174 -0
- package/data/skills/openai-jupyter-notebook/SKILL.md +107 -0
- package/data/skills/openai-linear/SKILL.md +87 -0
- package/data/skills/openai-netlify-deploy/SKILL.md +247 -0
- package/data/skills/openai-notion-knowledge-capture/SKILL.md +56 -0
- package/data/skills/openai-notion-meeting-intelligence/SKILL.md +60 -0
- package/data/skills/openai-notion-research-documentation/SKILL.md +59 -0
- package/data/skills/openai-notion-spec-to-implementation/SKILL.md +58 -0
- package/data/skills/openai-openai-docs/SKILL.md +69 -0
- package/data/skills/openai-pdf/SKILL.md +67 -0
- package/data/skills/openai-playwright/SKILL.md +147 -0
- package/data/skills/openai-render-deploy/SKILL.md +479 -0
- package/data/skills/openai-screenshot/SKILL.md +267 -0
- package/data/skills/openai-security-best-practices/SKILL.md +86 -0
- package/data/skills/openai-security-ownership-map/SKILL.md +206 -0
- package/data/skills/openai-security-threat-model/SKILL.md +81 -0
- package/data/skills/openai-sentry/SKILL.md +123 -0
- package/data/skills/openai-sora/SKILL.md +178 -0
- package/data/skills/openai-speech/SKILL.md +144 -0
- package/data/skills/openai-spreadsheet/SKILL.md +145 -0
- package/data/skills/openai-transcribe/SKILL.md +81 -0
- package/data/skills/openai-vercel-deploy/SKILL.md +77 -0
- package/data/skills/openai-yeet/SKILL.md +28 -0
- package/data/skills/pennylane/SKILL.md +224 -0
- package/data/skills/polars-bio/SKILL.md +374 -0
- package/data/skills/primekg/SKILL.md +97 -0
- package/data/skills/pymatgen/SKILL.md +689 -0
- package/data/skills/qiskit/SKILL.md +273 -0
- package/data/skills/qutip/SKILL.md +316 -0
- package/data/skills/recursive-decomposition/SKILL.md +185 -0
- package/data/skills/rowan/SKILL.md +427 -0
- package/data/skills/scholar-evaluation/SKILL.md +298 -0
- package/data/skills/sentry-create-alert/SKILL.md +210 -0
- package/data/skills/sentry-fix-issues/SKILL.md +126 -0
- package/data/skills/sentry-pr-code-review/SKILL.md +105 -0
- package/data/skills/sentry-python-sdk/SKILL.md +317 -0
- package/data/skills/sentry-setup-ai-monitoring/SKILL.md +217 -0
- package/data/skills/stable-baselines3/SKILL.md +297 -0
- package/data/skills/sympy/SKILL.md +498 -0
- package/data/skills/trailofbits-ask-questions-if-underspecified/SKILL.md +85 -0
- package/data/skills/trailofbits-audit-context-building/SKILL.md +302 -0
- package/data/skills/trailofbits-differential-review/SKILL.md +220 -0
- package/data/skills/trailofbits-insecure-defaults/SKILL.md +117 -0
- package/data/skills/trailofbits-modern-python/SKILL.md +333 -0
- package/data/skills/trailofbits-property-based-testing/SKILL.md +123 -0
- package/data/skills/trailofbits-semgrep-rule-creator/SKILL.md +172 -0
- package/data/skills/trailofbits-sharp-edges/SKILL.md +292 -0
- package/data/skills/trailofbits-variant-analysis/SKILL.md +142 -0
- package/data/skills/transformers.js/SKILL.md +637 -0
- package/data/skills/writing/SKILL.md +419 -0
- package/dist/bgi.js +66 -2
- package/package.json +1 -1
|
@@ -0,0 +1,178 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: hf-mcp
|
|
3
|
+
description: Use Hugging Face Hub via MCP server tools. Search models, datasets, Spaces, papers. Get repo details, fetch documentation, run compute jobs, and use Gradio Spaces as AI tools. Available when connected to the HF MCP server.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# Hugging Face MCP Server
|
|
7
|
+
|
|
8
|
+
Connect AI assistants to the Hugging Face Hub. Setup: https://huggingface.co/settings/mcp
|
|
9
|
+
|
|
10
|
+
## Use Cases & Examples
|
|
11
|
+
|
|
12
|
+
### Find the Best Model for a Task
|
|
13
|
+
|
|
14
|
+
```
|
|
15
|
+
User: "Find the best model for code generation"
|
|
16
|
+
|
|
17
|
+
1. model_search(task="text-generation", query="code", sort="trendingScore", limit=10)
|
|
18
|
+
2. hub_repo_details(repo_ids=["top-result-id"], include_readme=true)
|
|
19
|
+
```
|
|
20
|
+
|
|
21
|
+
### Compare Models from Different Providers
|
|
22
|
+
|
|
23
|
+
```
|
|
24
|
+
User: "Compare Llama vs Qwen for text generation"
|
|
25
|
+
|
|
26
|
+
1. model_search(author="meta-llama", task="text-generation", sort="downloads", limit=5)
|
|
27
|
+
2. model_search(author="Qwen", task="text-generation", sort="downloads", limit=5)
|
|
28
|
+
3. hub_repo_details(repo_ids=["meta-llama/Llama-3.2-1B", "Qwen/Qwen3-8B"], include_readme=true)
|
|
29
|
+
```
|
|
30
|
+
|
|
31
|
+
### Find Training Datasets
|
|
32
|
+
|
|
33
|
+
```
|
|
34
|
+
User: "Find datasets for sentiment analysis in English"
|
|
35
|
+
|
|
36
|
+
1. dataset_search(query="sentiment", tags=["language:en", "task_categories:text-classification"], sort="downloads")
|
|
37
|
+
2. hub_repo_details(repo_ids=["top-dataset-id"], repo_type="dataset", include_readme=true)
|
|
38
|
+
```
|
|
39
|
+
|
|
40
|
+
### Discover AI Tools (MCP Spaces)
|
|
41
|
+
|
|
42
|
+
```
|
|
43
|
+
User: "Find a tool that can remove image backgrounds"
|
|
44
|
+
|
|
45
|
+
1. space_search(query="background removal", mcp=true)
|
|
46
|
+
2. dynamic_space(operation="view_parameters", space_name="result-space-id")
|
|
47
|
+
3. dynamic_space(operation="invoke", space_name="result-space-id", parameters="{...}")
|
|
48
|
+
```
|
|
49
|
+
|
|
50
|
+
### Generate Images
|
|
51
|
+
|
|
52
|
+
```
|
|
53
|
+
User: "Create an image of a robot reading a book"
|
|
54
|
+
|
|
55
|
+
1. dynamic_space(operation="discover") # See available tasks
|
|
56
|
+
2. gr1_flux1_schnell_infer(prompt="a robot sitting in a library reading a book, warm lighting, detailed")
|
|
57
|
+
```
|
|
58
|
+
|
|
59
|
+
### Research a Topic
|
|
60
|
+
|
|
61
|
+
```
|
|
62
|
+
User: "What are the latest papers on RLHF?"
|
|
63
|
+
|
|
64
|
+
1. paper_search(query="reinforcement learning from human feedback", results_limit=10)
|
|
65
|
+
2. hub_repo_details(repo_ids=["paper-linked-model"], include_readme=true) # If paper links to models
|
|
66
|
+
```
|
|
67
|
+
|
|
68
|
+
### Learn How to Use a Library
|
|
69
|
+
|
|
70
|
+
```
|
|
71
|
+
User: "How do I fine-tune with LoRA using PEFT?"
|
|
72
|
+
|
|
73
|
+
1. hf_doc_search(query="LoRA fine-tuning", product="peft")
|
|
74
|
+
2. hf_doc_fetch(doc_url="https://huggingface.co/docs/peft/...")
|
|
75
|
+
```
|
|
76
|
+
|
|
77
|
+
### Run a Quick GPU Job
|
|
78
|
+
|
|
79
|
+
```
|
|
80
|
+
User: "Run this Python script on a GPU"
|
|
81
|
+
|
|
82
|
+
hf_jobs(operation="uv", args={
|
|
83
|
+
"script": "# /// script\n# dependencies = [\"torch\"]\n# ///\nimport torch\nprint(torch.cuda.is_available())",
|
|
84
|
+
"flavor": "t4-small"
|
|
85
|
+
})
|
|
86
|
+
```
|
|
87
|
+
|
|
88
|
+
### Train a Model on Cloud GPU
|
|
89
|
+
|
|
90
|
+
```
|
|
91
|
+
User: "Run my training script on an A10G"
|
|
92
|
+
|
|
93
|
+
hf_jobs(operation="run", args={
|
|
94
|
+
"image": "pytorch/pytorch:2.5.1-cuda12.4-cudnn9-runtime",
|
|
95
|
+
"command": ["/bin/sh", "-lc", "pip install transformers trl && python train.py"],
|
|
96
|
+
"flavor": "a10g-small",
|
|
97
|
+
"secrets": {"HF_TOKEN": "$HF_TOKEN"}
|
|
98
|
+
})
|
|
99
|
+
```
|
|
100
|
+
|
|
101
|
+
### Check Job Status
|
|
102
|
+
|
|
103
|
+
```
|
|
104
|
+
User: "What's happening with my training job?"
|
|
105
|
+
|
|
106
|
+
1. hf_jobs(operation="ps")
|
|
107
|
+
2. hf_jobs(operation="logs", args={"job_id": "job-xxxxx"})
|
|
108
|
+
```
|
|
109
|
+
|
|
110
|
+
### Explore What's Trending
|
|
111
|
+
|
|
112
|
+
```
|
|
113
|
+
User: "What models are trending right now?"
|
|
114
|
+
|
|
115
|
+
model_search(sort="trendingScore", limit=20)
|
|
116
|
+
```
|
|
117
|
+
|
|
118
|
+
### Get Model Card Details
|
|
119
|
+
|
|
120
|
+
```
|
|
121
|
+
User: "Tell me about Mistral-7B"
|
|
122
|
+
|
|
123
|
+
hub_repo_details(repo_ids=["mistralai/Mistral-7B-v0.1"], include_readme=true)
|
|
124
|
+
```
|
|
125
|
+
|
|
126
|
+
### Find Quantized Models
|
|
127
|
+
|
|
128
|
+
```
|
|
129
|
+
User: "Find GGUF versions of Llama 3"
|
|
130
|
+
|
|
131
|
+
model_search(query="Llama 3 GGUF", sort="downloads", limit=10)
|
|
132
|
+
```
|
|
133
|
+
|
|
134
|
+
### Use a Gradio Space as a Tool
|
|
135
|
+
|
|
136
|
+
```
|
|
137
|
+
User: "Transcribe this audio file"
|
|
138
|
+
|
|
139
|
+
1. space_search(query="speech to text transcription", mcp=true)
|
|
140
|
+
2. dynamic_space(operation="view_parameters", space_name="openai/whisper")
|
|
141
|
+
3. dynamic_space(operation="invoke", space_name="openai/whisper", parameters="{\"audio\": \"...\"}")
|
|
142
|
+
```
|
|
143
|
+
|
|
144
|
+
### Schedule Recurring Jobs
|
|
145
|
+
|
|
146
|
+
```
|
|
147
|
+
User: "Run this data sync every day at midnight"
|
|
148
|
+
|
|
149
|
+
hf_jobs(operation="scheduled uv", args={
|
|
150
|
+
"script": "...",
|
|
151
|
+
"cron": "0 0 * * *",
|
|
152
|
+
"flavor": "cpu-basic"
|
|
153
|
+
})
|
|
154
|
+
```
|
|
155
|
+
|
|
156
|
+
## Tool Selection Guide
|
|
157
|
+
|
|
158
|
+
| Goal | Tool |
|
|
159
|
+
|------|------|
|
|
160
|
+
| Find models | `model_search` |
|
|
161
|
+
| Find datasets | `dataset_search` |
|
|
162
|
+
| Find Spaces/apps | `space_search` |
|
|
163
|
+
| Find papers | `paper_search` |
|
|
164
|
+
| Get repo README/details | `hub_repo_details` |
|
|
165
|
+
| Learn library usage | `hf_doc_search` → `hf_doc_fetch` |
|
|
166
|
+
| Run code on GPU/CPU | `hf_jobs` |
|
|
167
|
+
| Use Gradio apps as tools | `dynamic_space` |
|
|
168
|
+
| Generate images | `gr1_flux1_schnell_infer` or `dynamic_space` |
|
|
169
|
+
| Check auth | `hf_whoami` |
|
|
170
|
+
|
|
171
|
+
## Tips
|
|
172
|
+
|
|
173
|
+
- Use `sort="trendingScore"` to find what's popular now
|
|
174
|
+
- Use `sort="downloads"` to find battle-tested options
|
|
175
|
+
- Set `mcp=true` in `space_search` to find Spaces usable as tools
|
|
176
|
+
- Use `include_readme=true` in `hub_repo_details` for full model/dataset documentation
|
|
177
|
+
- For jobs accessing private repos, always include `secrets: {"HF_TOKEN": "$HF_TOKEN"}`
|
|
178
|
+
- Use `dynamic_space(operation="discover")` to see all available Space-based tasks
|
|
@@ -0,0 +1,121 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: hugging-face-dataset-viewer
|
|
3
|
+
description: Use this skill for Hugging Face Dataset Viewer API workflows that fetch subset/split metadata, paginate rows, search text, apply filters, download parquet URLs, and read size or statistics.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# Hugging Face Dataset Viewer
|
|
7
|
+
|
|
8
|
+
Use this skill to execute read-only Dataset Viewer API calls for dataset exploration and extraction.
|
|
9
|
+
|
|
10
|
+
## Core workflow
|
|
11
|
+
|
|
12
|
+
1. Optionally validate dataset availability with `/is-valid`.
|
|
13
|
+
2. Resolve `config` + `split` with `/splits`.
|
|
14
|
+
3. Preview with `/first-rows`.
|
|
15
|
+
4. Paginate content with `/rows` using `offset` and `length` (max 100).
|
|
16
|
+
5. Use `/search` for text matching and `/filter` for row predicates.
|
|
17
|
+
6. Retrieve parquet links via `/parquet` and totals/metadata via `/size` and `/statistics`.
|
|
18
|
+
|
|
19
|
+
## Defaults
|
|
20
|
+
|
|
21
|
+
- Base URL: `https://datasets-server.huggingface.co`
|
|
22
|
+
- Default API method: `GET`
|
|
23
|
+
- Query params should be URL-encoded.
|
|
24
|
+
- `offset` is 0-based.
|
|
25
|
+
- `length` max is usually `100` for row-like endpoints.
|
|
26
|
+
- Gated/private datasets require `Authorization: Bearer <HF_TOKEN>`.
|
|
27
|
+
|
|
28
|
+
## Dataset Viewer
|
|
29
|
+
|
|
30
|
+
- `Validate dataset`: `/is-valid?dataset=<namespace/repo>`
|
|
31
|
+
- `List subsets and splits`: `/splits?dataset=<namespace/repo>`
|
|
32
|
+
- `Preview first rows`: `/first-rows?dataset=<namespace/repo>&config=<config>&split=<split>`
|
|
33
|
+
- `Paginate rows`: `/rows?dataset=<namespace/repo>&config=<config>&split=<split>&offset=<int>&length=<int>`
|
|
34
|
+
- `Search text`: `/search?dataset=<namespace/repo>&config=<config>&split=<split>&query=<text>&offset=<int>&length=<int>`
|
|
35
|
+
- `Filter with predicates`: `/filter?dataset=<namespace/repo>&config=<config>&split=<split>&where=<predicate>&orderby=<sort>&offset=<int>&length=<int>`
|
|
36
|
+
- `List parquet shards`: `/parquet?dataset=<namespace/repo>`
|
|
37
|
+
- `Get size totals`: `/size?dataset=<namespace/repo>`
|
|
38
|
+
- `Get column statistics`: `/statistics?dataset=<namespace/repo>&config=<config>&split=<split>`
|
|
39
|
+
- `Get Croissant metadata (if available)`: `/croissant?dataset=<namespace/repo>`
|
|
40
|
+
|
|
41
|
+
Pagination pattern:
|
|
42
|
+
|
|
43
|
+
```bash
|
|
44
|
+
curl "https://datasets-server.huggingface.co/rows?dataset=stanfordnlp/imdb&config=plain_text&split=train&offset=0&length=100"
|
|
45
|
+
curl "https://datasets-server.huggingface.co/rows?dataset=stanfordnlp/imdb&config=plain_text&split=train&offset=100&length=100"
|
|
46
|
+
```
|
|
47
|
+
|
|
48
|
+
When pagination is partial, use response fields such as `num_rows_total`, `num_rows_per_page`, and `partial` to drive continuation logic.
|
|
49
|
+
|
|
50
|
+
Search/filter notes:
|
|
51
|
+
|
|
52
|
+
- `/search` matches string columns (full-text style behavior is internal to the API).
|
|
53
|
+
- `/filter` requires predicate syntax in `where` and optional sort in `orderby`.
|
|
54
|
+
- Keep filtering and searches read-only and side-effect free.
|
|
55
|
+
|
|
56
|
+
## Querying Datasets
|
|
57
|
+
|
|
58
|
+
Use `npx parquetlens` with Hub parquet alias paths for SQL querying.
|
|
59
|
+
|
|
60
|
+
Parquet alias shape:
|
|
61
|
+
|
|
62
|
+
```text
|
|
63
|
+
hf://datasets/<namespace>/<repo>@~parquet/<config>/<split>/<shard>.parquet
|
|
64
|
+
```
|
|
65
|
+
|
|
66
|
+
Derive `<config>`, `<split>`, and `<shard>` from Dataset Viewer `/parquet`:
|
|
67
|
+
|
|
68
|
+
```bash
|
|
69
|
+
curl -s "https://datasets-server.huggingface.co/parquet?dataset=cfahlgren1/hub-stats" \
|
|
70
|
+
| jq -r '.parquet_files[] | "hf://datasets/\(.dataset)@~parquet/\(.config)/\(.split)/\(.filename)"'
|
|
71
|
+
```
|
|
72
|
+
|
|
73
|
+
Run SQL query:
|
|
74
|
+
|
|
75
|
+
```bash
|
|
76
|
+
npx -y -p parquetlens -p @parquetlens/sql parquetlens \
|
|
77
|
+
"hf://datasets/<namespace>/<repo>@~parquet/<config>/<split>/<shard>.parquet" \
|
|
78
|
+
--sql "SELECT * FROM data LIMIT 20"
|
|
79
|
+
```
|
|
80
|
+
|
|
81
|
+
### SQL export
|
|
82
|
+
|
|
83
|
+
- CSV: `--sql "COPY (SELECT * FROM data LIMIT 1000) TO 'export.csv' (FORMAT CSV, HEADER, DELIMITER ',')"`
|
|
84
|
+
- JSON: `--sql "COPY (SELECT * FROM data LIMIT 1000) TO 'export.json' (FORMAT JSON)"`
|
|
85
|
+
- Parquet: `--sql "COPY (SELECT * FROM data LIMIT 1000) TO 'export.parquet' (FORMAT PARQUET)"`
|
|
86
|
+
|
|
87
|
+
## Creating and Uploading Datasets
|
|
88
|
+
|
|
89
|
+
Use one of these flows depending on dependency constraints.
|
|
90
|
+
|
|
91
|
+
Zero local dependencies (Hub UI):
|
|
92
|
+
|
|
93
|
+
- Create dataset repo in browser: `https://huggingface.co/new-dataset`
|
|
94
|
+
- Upload parquet files in the repo "Files and versions" page.
|
|
95
|
+
- Verify shards appear in Dataset Viewer:
|
|
96
|
+
|
|
97
|
+
```bash
|
|
98
|
+
curl -s "https://datasets-server.huggingface.co/parquet?dataset=<namespace>/<repo>"
|
|
99
|
+
```
|
|
100
|
+
|
|
101
|
+
Low dependency CLI flow (`npx @huggingface/hub` / `hfjs`):
|
|
102
|
+
|
|
103
|
+
- Set auth token:
|
|
104
|
+
|
|
105
|
+
```bash
|
|
106
|
+
export HF_TOKEN=<your_hf_token>
|
|
107
|
+
```
|
|
108
|
+
|
|
109
|
+
- Upload parquet folder to a dataset repo (auto-creates repo if missing):
|
|
110
|
+
|
|
111
|
+
```bash
|
|
112
|
+
npx -y @huggingface/hub upload datasets/<namespace>/<repo> ./local/parquet-folder data
|
|
113
|
+
```
|
|
114
|
+
|
|
115
|
+
- Upload as private repo on creation:
|
|
116
|
+
|
|
117
|
+
```bash
|
|
118
|
+
npx -y @huggingface/hub upload datasets/<namespace>/<repo> ./local/parquet-folder data --private
|
|
119
|
+
```
|
|
120
|
+
|
|
121
|
+
After upload, call `/parquet` to discover `<config>/<split>/<shard>` values for querying with `@~parquet`.
|