Barry robbins
Barryrobbins64
AI & ML interests
None yet
Recent Activity
replied to
singhsidhukuldeep's
post
about 2 months ago
Are you tired of writing scripts to scrape data from the web? π
ScrapeGraphAI is here for you! π
ScrapeGraphAI is an OPEN-SOURCE web scraping Python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, etc.). ππ
Just say which information you want to extract (in human language) and the library will do it for you! π£οΈπ
It supports GPT, Gemini, and open-source models like Mistral. π
A few things that I could not find in the docs but would be amazing to see π€:
- Captcha handling π
- Persistent data output formatting π
- Streaming output π‘
- Explanationπ of the tag line: "ScrapeGraphAI: You Only Scrape Once" What does that even mean? π€£ Is this YOLO? π€
Link: https://github.com/VinciGit00/Scrapegraph-ai
Demo code: https://github.com/amrrs/scrapegraph-code/blob/main/sourcegraph.ipynb
Organizations
None yet
models
None public yet
datasets
None public yet