doc-to-vector-dataset-generator

Converts documents into clean, chunked datasets suitable for embeddings and vector search. Produces chunked JSONL files with metadata, deduplication logic, and quality checks. Use when preparing "trai

by patricio0312rev· Repository·data
Also installable via skills CLI
npx skills add patricio0312rev/skillset/templates/ai-engineering/doc-to-vector-dataset-generator

Source

Path:templates/ai-engineering/doc-to-vector-dataset-generator(main)

Related in data

doc-to-vector-dataset-generator | AgentArea Skills