首页/数据分析/parallel-data-enrichment
P

parallel-data-enrichment

by @parallel-webv
4.5(20)

批量从网络源(如CEO姓名、公司信息)丰富现有数据,提升数据集的完整性和价值,支持更深入的分析。

data-enrichmentparallel-processingdata-integrationetldata-pipelinesGitHub
安装方式
npx skills add parallel-web/parallel-agent-skills --skill parallel-data-enrichment
compare_arrows

Before / After 效果对比

1
使用前

原始数据信息不完整,需要手动查找补充,耗时耗力且容易出错,影响数据分析的深度和广度。

使用后

自动从网络批量获取并补充数据字段,如CEO姓名、融资信息等,极大丰富数据维度,提升数据分析价值。

SKILL.md

Data Enrichment

Enrich: $ARGUMENTS

Before starting

Inform the user that enrichment may take several minutes depending on the number of rows and fields requested.

Step 1: Start the enrichment

Use ONE of these command patterns (substitute user's actual data):

For inline data:

parallel-cli enrich run --data '[{"company": "Google"}, {"company": "Microsoft"}]' --intent "CEO name and founding year" --target "output.csv" --no-wait

For CSV file:

parallel-cli enrich run --source-type csv --source "input.csv" --target "output.csv" --source-columns '[{"name": "company", "description": "Company name"}]' --intent "CEO name and founding year" --no-wait

IMPORTANT: Always include --no-wait so the command returns immediately instead of blocking.

Parse the output to extract the taskgroup_id and monitoring URL. Immediately tell the user:

  • Enrichment has been kicked off
  • The monitoring URL where they can track progress

Tell them they can background the polling step to continue working while it runs.

Step 2: Poll for results

parallel-cli enrich poll "$TASKGROUP_ID" --timeout 540

Important:

  • Use --timeout 540 (9 minutes) to stay within tool execution limits

If the poll times out

Enrichment of large datasets can take longer than 9 minutes. If the poll exits without completing:

  1. Tell the user the enrichment is still running server-side
  2. Re-run the same parallel-cli enrich poll command to continue waiting

Response format

After step 1: Share the monitoring URL (for tracking progress).

After step 2:

  1. Report number of rows enriched
  2. Preview first few rows of the output CSV
  3. Tell user the full path to the output CSV file

Do NOT re-share the monitoring URL after completion — the results are in the output file.

Setup

If parallel-cli is not found, install and authenticate:

curl -fsSL https://parallel.ai/install.sh | bash

If unable to install that way, install via pipx instead:

pipx install "parallel-web-tools[cli]"
pipx ensurepath

Then authenticate:

parallel-cli login

Or set an API key: export PARALLEL_API_KEY="your-key"

用户评价 (0)

发表评价

效果
易用性
文档
兼容性

暂无评价

统计数据

安装量7.7K
评分4.5 / 5.0
版本
更新日期2026年5月21日
对比案例1 组

用户评分

4.5(20)
5
15%
4
45%
3
35%
2
5%
1
0%

为此 Skill 评分

0.0

兼容平台

🔧Claude Code
🔧OpenClaw
🔧OpenCode
🔧Codex
🔧Gemini CLI
🔧GitHub Copilot
🔧Amp
🔧Kimi CLI

时间线

创建2026年3月16日
最后更新2026年5月21日