apify/apify-haystack
The official integration for Apify and Haystack 2.0
Provides Haystack document components that execute Apify Actors for web scraping and data extraction, automatically converting scraped datasets into Haystack's Document format. Enables RAG pipelines and LLM applications to ingest fresh web data directly from serverless Apify tasks like website crawling, social media scraping, and structured data extraction. Supports custom mapping functions to transform raw Actor outputs into typed Documents compatible with Haystack's retrieval and indexing workflows.
Available on PyPI.
Stars
3
Forks
1
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 19, 2026
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/apify/apify-haystack"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
OpenBMB/UltraRAG
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
AnkitNayak-eth/EpsteinFiles-RAG
A RAG pipeline implementation built on the 'Epstein Files 20K' dataset from Hugging Face (Teyler).
Quansight/ragna
RAG orchestration framework ⛵️
microsoft/rag-time
RAG Time: A 5-week Learning Journey to Mastering RAG
microsoft/rag-experiment-accelerator
The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the...