Home/Data Analysis/parallel-data-enrichment
P

parallel-data-enrichment

by @parallel-webv
4.5(20)

Batch enriches existing data from web sources (e.g., CEO names, company information), enhancing dataset completeness and value to support deeper analysis.

data-enrichmentparallel-processingdata-integrationetldata-pipelinesGitHub
Installation
npx skills add parallel-web/parallel-agent-skills --skill parallel-data-enrichment
compare_arrows

Before / After Comparison

1
Before

Raw data information is incomplete, requiring manual lookup and supplementation, which is time-consuming, labor-intensive, and prone to errors, affecting the depth and breadth of data analysis.

After

Automatically batch-acquires and supplements data fields from the web, such as CEO names, financing information, etc., greatly enriching data dimensions and enhancing data analysis value.

SKILL.md

Data Enrichment

Enrich: $ARGUMENTS

Before starting

Inform the user that enrichment may take several minutes depending on the number of rows and fields requested.

Step 1: Start the enrichment

Use ONE of these command patterns (substitute user's actual data):

For inline data:

parallel-cli enrich run --data '[{"company": "Google"}, {"company": "Microsoft"}]' --intent "CEO name and founding year" --target "output.csv" --no-wait

For CSV file:

parallel-cli enrich run --source-type csv --source "input.csv" --target "output.csv" --source-columns '[{"name": "company", "description": "Company name"}]' --intent "CEO name and founding year" --no-wait

IMPORTANT: Always include --no-wait so the command returns immediately instead of blocking.

Parse the output to extract the taskgroup_id and monitoring URL. Immediately tell the user:

  • Enrichment has been kicked off
  • The monitoring URL where they can track progress

Tell them they can background the polling step to continue working while it runs.

Step 2: Poll for results

parallel-cli enrich poll "$TASKGROUP_ID" --timeout 540

Important:

  • Use --timeout 540 (9 minutes) to stay within tool execution limits

If the poll times out

Enrichment of large datasets can take longer than 9 minutes. If the poll exits without completing:

  1. Tell the user the enrichment is still running server-side
  2. Re-run the same parallel-cli enrich poll command to continue waiting

Response format

After step 1: Share the monitoring URL (for tracking progress).

After step 2:

  1. Report number of rows enriched
  2. Preview first few rows of the output CSV
  3. Tell user the full path to the output CSV file

Do NOT re-share the monitoring URL after completion — the results are in the output file.

Setup

If parallel-cli is not found, install and authenticate:

curl -fsSL https://parallel.ai/install.sh | bash

If unable to install that way, install via pipx instead:

pipx install "parallel-web-tools[cli]"
pipx ensurepath

Then authenticate:

parallel-cli login

Or set an API key: export PARALLEL_API_KEY="your-key"

User Reviews (0)

Write a Review

Effect
Usability
Docs
Compatibility

No reviews yet

Statistics

Installs7.7K
Rating4.5 / 5.0
Version
Updated2026年5月21日
Comparisons1

User Rating

4.5(20)
5
15%
4
45%
3
35%
2
5%
1
0%

Rate this Skill

0.0

Compatible Platforms

🔧Claude Code
🔧OpenClaw
🔧OpenCode
🔧Codex
🔧Gemini CLI
🔧GitHub Copilot
🔧Amp
🔧Kimi CLI

Timeline

Created2026年3月16日
Last Updated2026年5月21日