Stars
CrawleeβA web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, anβ¦
estela, an elastic web scraping cluster πΈ
Slack integration for Django, using the templating engine to generate messages
Declarative model lifecycle hooks, an alternative to Signals.
A fast admin dashboard based on FastAPI and TortoiseORM with tabler ui, inspired by Django admin
Snowflake is a network service for generating unique ID numbers at high scale with some simple guarantees.
π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.
Doppelganger-finder finds multiple accounts (doppelgangers) of a user.
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
A MNIST-like fashion product database. Benchmark π
A WebGL accelerated JavaScript library for training and deploying ML models.
Notebooks for learning deep learning
Trax β Deep Learning with Clear Code and Speed
The aim of this project is to join together all the contribution that people around the world can offer to help everyone to overcome the devastating outbreak of COVID19
Random User-Agent middleware based on fake-useragent
β‘ A Fast, Extensible Progress Bar for Python and CLI
π Parameterize, execute, and analyze notebooks
π Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
ECMAScript parsing infrastructure for multipurpose analysis
The Sonar WebDriver Plugin is a static code analysis tool that helps following best practices for writing WebDriver tests
Google Auth Python Library
Imaging, analysis, and simulation software for radio interferometry
Automatically mock your HTTP interactions to simplify and speed up testing


