๐Ÿ Hand-picked awesome Python libraries and frameworks, with an emphasis on data and machine learning, organised by category

dylanhogg dylanhogg Last update: Mar 04, 2024

awesome-python Awesome

Hand-picked awesome Python libraries and frameworks, with an emphasis on data and machine learning, organised by category ๐Ÿ

Interactive version: www.awesomepython.org

Updated 27 Feb 2024

Categories

  • Code quality - Code quality tooling: linters, formatters, pre-commit hooks, unused code removal (11 repos)
  • Crypto and Blockchain - Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity (12 repos)
  • Data - General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks (93 repos)
  • Debugging - Debugging and tracing tools (9 repos)
  • Diffusion Text to Image - Text-to-image diffusion model libraries, tools and apps for generating images from natural language (35 repos)
  • Finance - Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives (31 repos)
  • Game Development - Game development tools, engines and libraries (6 repos)
  • GIS - Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections (27 repos)
  • Graph - Graphs and network libraries: network analysis, graph machine learning, visualisation (4 repos)
  • GUI - Graphical user interface libraries and toolkits (8 repos)
  • Jupyter - Jupyter and JupyterLab and Notebook tools, libraries and plugins (24 repos)
  • LLMs and ChatGPT - Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover (202 repos)
  • Math and Science - Maths and science libraries (14 repos)
  • Machine Learning - General - General and classical machine learning libraries. See below for other sections covering specialised ML areas (150 repos)
  • Machine Learning - Deep Learning - Machine learning libraries that cross over with deep learning in some way (69 repos)
  • Machine Learning - Interpretability - Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training (13 repos)
  • Machine Learning - Ops - MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models (40 repos)
  • Machine Learning - Reinforcement - Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF (21 repos)
  • Machine Learning - Time Series - Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics (17 repos)
  • Natural Language Processing - Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover (80 repos)
  • Packaging - Python packaging, dependency management and bundling (25 repos)
  • Pandas - Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations (23 repos)
  • Performance - Performance, parallelisation and low level libraries (28 repos)
  • Profiling - Memory and CPU/GPU profiling tools and libraries (11 repos)
  • Security - Security related libraries: vulnerability discovery, SQL injection, environment auditing (12 repos)
  • Simulation - Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover (25 repos)
  • Study - Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials (51 repos)
  • Template - Template tools and libraries: cookiecutter repos, generators, quick-starts (8 repos)
  • Terminal - Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars (14 repos)
  • Testing - Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins (21 repos)
  • Typing - Typing libraries: static and run-time type checking, annotations (10 repos)
  • Utility - General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools, (178 repos)
  • Vizualisation - Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL (30 repos)
  • Web - Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management (52 repos)

Code quality

Code quality tooling: linters, formatters, pre-commit hooks, unused code removal.

  1. psf/black
    The uncompromising Python code formatter
    ๐Ÿ”— black.readthedocs.io/en/stable
    โญ 36,772

  2. astral-sh/ruff
    An extremely fast Python linter and code formatter, written in Rust.
    ๐Ÿ”— docs.astral.sh/ruff
    โญ 23,779

  3. google/yapf
    A formatter for Python files
    โญ 13,587

  4. pre-commit/pre-commit
    A framework for managing and maintaining multi-language pre-commit hooks.
    ๐Ÿ”— pre-commit.com
    โญ 11,660

  5. pycqa/isort
    A Python utility / library to sort imports.
    ๐Ÿ”— pycqa.github.io/isort
    โญ 6,239

  6. pycqa/pylint
    It's not just a linter that annoys you!
    ๐Ÿ”— pylint.readthedocs.io/en/latest
    โญ 5,037

  7. asottile/pyupgrade
    A tool (and pre-commit hook) to automatically upgrade syntax for newer versions of the language.
    โญ 3,216

  8. pycqa/flake8
    flake8 is a python tool that glues together pycodestyle, pyflakes, mccabe, and third-party plugins to check the style and quality of some python code.
    ๐Ÿ”— flake8.pycqa.org
    โญ 3,186

  9. jendrikseipp/vulture
    Find dead Python code
    โญ 2,918

  10. sourcery-ai/sourcery
    Instant AI code reviews to speed up your pull requests
    ๐Ÿ”— sourcery.ai
    โญ 1,466

  11. akaihola/darker
    Apply black reformatting to Python files only in regions changed since a given commit. For a practical usage example, see the blog post at https://dev.to/akaihola/improving-python-code-incrementally-3f7a
    ๐Ÿ”— pypi.org/project/darker
    โญ 602

Crypto and Blockchain

Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity.

  1. ccxt/ccxt
    A JavaScript / TypeScript / Python / C# / PHP cryptocurrency trading API with support for more than 100 bitcoin/altcoin exchanges
    ๐Ÿ”— docs.ccxt.com
    โญ 30,501

  2. freqtrade/freqtrade
    Free, open source crypto trading bot
    ๐Ÿ”— www.freqtrade.io
    โญ 24,429

  3. crytic/slither
    Static Analyzer for Solidity and Vyper
    ๐Ÿ”— blog.trailofbits.com/2018/10/19/slither-a-solidity-static-analysis-framework
    โญ 4,864

  4. ethereum/web3.py
    A python interface for interacting with the Ethereum blockchain and ecosystem.
    ๐Ÿ”— web3py.readthedocs.io
    โญ 4,643

  5. ethereum/consensus-specs
    Ethereum Proof-of-Stake Consensus Specifications
    โญ 3,353

  6. cyberpunkmetalhead/Binance-volatility-trading-bot
    This is a fully functioning Binance trading bot that measures the volatility of every coin on Binance and places trades with the highest gaining coins If you like this project consider donating though the Brave browser to allow me to continuously improve the script.
    โญ 3,314

  7. ethereum/py-evm
    A Python implementation of the Ethereum Virtual Machine
    ๐Ÿ”— py-evm.readthedocs.io/en/latest
    โญ 2,152

  8. bmoscon/cryptofeed
    Cryptocurrency Exchange Websocket Data Feed Handler
    โญ 2,007

  9. binance/binance-public-data
    Details on how to get Binance public data
    โญ 1,258

  10. ofek/bit
    Bitcoin made easy.
    ๐Ÿ”— ofek.dev/bit
    โญ 1,186

  11. man-c/pycoingecko
    Python wrapper for the CoinGecko API
    โญ 1,025

  12. palkeo/panoramix
    Ethereum decompiler
    โญ 730

Data

General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks.

  1. scrapy/scrapy
    Scrapy, a fast high-level web crawling & scraping framework for Python.
    ๐Ÿ”— scrapy.org
    โญ 50,175

  2. apache/spark
    Apache Spark - A unified analytics engine for large-scale data processing
    ๐Ÿ”— spark.apache.org
    โญ 37,846

  3. getredash/redash
    Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
    ๐Ÿ”— redash.io
    โญ 24,621

  4. jaidedai/EasyOCR
    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
    ๐Ÿ”— www.jaided.ai
    โญ 21,132

  5. mindsdb/mindsdb
    The middleware for building custom AI, enabling smarter organizations.
    ๐Ÿ”— mindsdb.com
    โญ 19,974

  6. joke2k/faker
    Faker is a Python package that generates fake data for you.
    ๐Ÿ”— faker.readthedocs.io
    โญ 16,867

  7. qdrant/qdrant
    Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
    ๐Ÿ”— qdrant.tech
    โญ 16,241

  8. binux/pyspider
    A Powerful Spider(Web Crawler) System in Python.
    ๐Ÿ”— docs.pyspider.org
    โญ 16,219

  9. twintproject/twint
    An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
    โญ 15,432

  10. airbytehq/airbyte
    The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
    ๐Ÿ”— airbyte.com
    โญ 13,225

  11. apache/arrow
    Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
    ๐Ÿ”— arrow.apache.org
    โญ 13,165

  12. tiangolo/sqlmodel
    SQL databases in Python, designed for simplicity, compatibility, and robustness.
    ๐Ÿ”— sqlmodel.tiangolo.com
    โญ 12,311

  13. redis/redis-py
    Redis Python Client
    โญ 12,087

  14. chroma-core/chroma
    the AI-native open-source embedding database
    ๐Ÿ”— www.trychroma.com
    โญ 11,059

  15. coleifer/peewee
    a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
    ๐Ÿ”— docs.peewee-orm.com
    โญ 10,660

  16. s0md3v/Photon
    Incredibly fast crawler designed for OSINT.
    โญ 10,367

  17. simonw/datasette
    An open source multi-tool for exploring and publishing data
    ๐Ÿ”— datasette.io
    โญ 8,715

  18. sqlalchemy/sqlalchemy
    The Database Toolkit for Python
    ๐Ÿ”— www.sqlalchemy.org
    โญ 8,430

  19. bigscience-workshop/petals
    ๐ŸŒธ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
    ๐Ÿ”— petals.dev
    โญ 8,404

  20. yzhao062/pyod
    A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
    ๐Ÿ”— pyod.readthedocs.io
    โญ 7,807

  21. sqlfluff/sqlfluff
    A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
    ๐Ÿ”— www.sqlfluff.com
    โญ 7,033

  22. gristlabs/grist-core
    Grist is the evolution of spreadsheets.
    ๐Ÿ”— www.getgrist.com
    โญ 5,965

  23. alirezamika/autoscraper
    A Smart, Automatic, Fast and Lightweight Web Scraper for Python
    โญ 5,807

  24. kaggle/kaggle-api
    Official Kaggle API
    โญ 5,785

  25. vi3k6i5/flashtext
    Extract Keywords from sentence or Replace keywords in sentences.
    โญ 5,512

  26. avaiga/taipy
    Turns Data and AI algorithms into production-ready web applications in no time.
    ๐Ÿ”— www.taipy.io
    โญ 5,491

  27. airbnb/knowledge-repo
    A next-generation curated knowledge sharing platform for data scientists and other technical professions.
    โญ 5,414

  28. madmaze/pytesseract
    A Python wrapper for Google Tesseract
    โญ 5,365

  29. tobymao/sqlglot
    Python SQL Parser and Transpiler
    ๐Ÿ”— sqlglot.com
    โญ 4,907

  30. facebookresearch/AugLy
    A data augmentations library for audio, image, text, and video.
    ๐Ÿ”— ai.facebook.com/blog/augly-a-new-data-augmentation-library-to-help-build-more-robust-ai-models
    โญ 4,873

  31. jazzband/tablib
    Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
    ๐Ÿ”— tablib.readthedocs.io
    โญ 4,482

  32. lk-geimfari/mimesis
    Mimesis is a powerful Python library that empowers developers to generate massive amounts of synthetic data efficiently.
    ๐Ÿ”— mimesis.name
    โญ 4,220

  33. amundsen-io/amundsen
    Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
    ๐Ÿ”— www.amundsen.io/amundsen
    โญ 4,206

  34. superduperdb/superduperdb
    ๐Ÿ”ฎ SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.
    ๐Ÿ”— superduperdb.com
    โญ 4,136

  35. mongodb/mongo-python-driver
    PyMongo - the Official MongoDB Python driver
    ๐Ÿ”— pymongo.readthedocs.io
    โญ 4,009

  36. ibis-project/ibis
    Ibis is a Python library that provides a lightweight, universal interface for data wrangling. It helps Python users explore and transform data of any size, stored anywhere.
    ๐Ÿ”— ibis-project.org
    โญ 3,682

  37. andialbrecht/sqlparse
    A non-validating SQL parser module for Python
    โญ 3,527

  38. jmcnamara/XlsxWriter
    A Python module for creating Excel XLSX files.
    ๐Ÿ”— xlsxwriter.readthedocs.io
    โญ 3,428

  39. run-llama/llama-hub
    A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
    ๐Ÿ”— llamahub.ai
    โญ 3,337

  40. praw-dev/praw
    PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.
    ๐Ÿ”— praw.readthedocs.io
    โญ 3,265

  41. deepchecks/deepchecks
    Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
    ๐Ÿ”— docs.deepchecks.com/stable
    โญ 3,244

  42. pyeve/cerberus
    Lightweight, extensible data validation library for Python
    ๐Ÿ”— python-cerberus.org
    โญ 3,085

  43. rom1504/img2dataset
    Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
    โญ 3,048

  44. pallets/itsdangerous
    Safely pass trusted data to untrusted environments and back.
    ๐Ÿ”— itsdangerous.palletsprojects.com
    โญ 2,803

  45. zoomeranalytics/xlwings
    xlwings is a Python library that makes it easy to call Python from Excel and vice versa. It works with Excel on Windows and macOS as well as with Google Sheets and Excel on the web.
    ๐Ÿ”— www.xlwings.org
    โญ 2,792

  46. goldsmith/Wikipedia
    A Pythonic wrapper for the Wikipedia API
    ๐Ÿ”— wikipedia.readthedocs.org
    โญ 2,790

  47. datafold/data-diff
    Compare tables within or across databases
    ๐Ÿ”— docs.datafold.com
    โญ 2,749

  48. awslabs/amazon-redshift-utils
    Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
    โญ 2,690

  49. docarray/docarray
    Represent, send, store and search multimodal data
    ๐Ÿ”— docs.docarray.org
    โญ 2,662

  50. giskard-ai/giskard
    ๐Ÿข The testing framework for ML models, from tabular to LLMs
    ๐Ÿ”— docs.giskard.ai
    โญ 2,560

  51. pynamodb/PynamoDB
    A pythonic interface to Amazon's DynamoDB
    ๐Ÿ”— pynamodb.readthedocs.io
    โญ 2,348

  52. sqlalchemy/alembic
    A database migrations tool for SQLAlchemy.
    โญ 2,345

  53. kayak/pypika
    PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.
    ๐Ÿ”— pypika.readthedocs.io/en/latest
    โญ 2,313

  54. lancedb/lancedb
    Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
    ๐Ÿ”— lancedb.github.io/lancedb
    โญ 2,262

  55. uqfoundation/dill
    serialize all of Python
    ๐Ÿ”— dill.rtfd.io
    โญ 2,087

  56. accenture/AmpliGraph
    Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org
    โญ 2,057

  57. graphistry/pygraphistry
    PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
    โญ 2,007

  58. sdv-dev/SDV
    Synthetic data generation for tabular data
    ๐Ÿ”— docs.sdv.dev/sdv
    โญ 1,815

  59. samuelcolvin/arq
    Fast job queuing and RPC in python with asyncio and redis.
    ๐Ÿ”— arq-docs.helpmanual.io
    โญ 1,797

  60. uber/petastorm
    Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
    โญ 1,724

  61. rapidai/RapidOCR
    A cross platform OCR Library based on PaddleOCR & OnnxRuntime & OpenVINO.
    ๐Ÿ”— rapidai.github.io/rapidocrdocs/docs
    โญ 1,716

  62. sfu-db/connector-x
    Fastest library to load data from DB to DataFrames in Rust and Python
    ๐Ÿ”— sfu-db.github.io/connector-x/intro.html
    โญ 1,695

  63. aio-libs/aiomysql
    aiomysql is a library for accessing a MySQL database from the asyncio
    ๐Ÿ”— aiomysql.rtfd.io
    โญ 1,682

  64. agronholm/sqlacodegen
    Automatic model code generator for SQLAlchemy
    โญ 1,646

  65. simple-salesforce/simple-salesforce
    A very simple Salesforce.com REST API client for Python
    โญ 1,558

  66. milvus-io/bootcamp
    Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
    ๐Ÿ”— milvus.io
    โญ 1,548

  67. collerek/ormar
    python async orm with fastapi in mind and pydantic validation
    ๐Ÿ”— collerek.github.io/ormar
    โญ 1,529

  68. aminalaee/sqladmin
    SQLAlchemy Admin for FastAPI and Starlette
    ๐Ÿ”— aminalaee.dev/sqladmin
    โญ 1,455

  69. sdispater/orator
    The Orator ORM provides a simple yet beautiful ActiveRecord implementation.
    ๐Ÿ”— orator-orm.com
    โญ 1,421

  70. simonw/sqlite-utils
    Python CLI utility and library for manipulating SQLite databases
    ๐Ÿ”— sqlite-utils.datasette.io
    โญ 1,415

  71. mchong6/JoJoGAN
    Official PyTorch repo for JoJoGAN: One Shot Face Stylization
    โญ 1,399

  72. pathwaycom/pathway
    Pathway is a high-throughput, low-latency data processing framework that handles live data & streaming for you. Made with โค๏ธ for Python & ML/AI developers.
    ๐Ÿ”— pathway.com
    โญ 1,363

  73. aio-libs/aiopg
    aiopg is a library for accessing a PostgreSQL database from the asyncio
    ๐Ÿ”— aiopg.readthedocs.io
    โญ 1,360

  74. eleutherai/the-pile
    The Pile is a large, diverse, open source language modelling data set that consists of many smaller datasets combined together.
    โญ 1,356

  75. dlt-hub/dlt
    data load tool (dlt) is an open source Python library that makes data loading easy ๐Ÿ› ๏ธ
    ๐Ÿ”— dlthub.com/docs
    โญ 1,317

  76. zarr-developers/zarr-python
    An implementation of chunked, compressed, N-dimensional arrays for Python.
    ๐Ÿ”— zarr.readthedocs.io
    โญ 1,298

  77. google/tensorstore
    Library for reading and writing large multi-dimensional arrays.
    ๐Ÿ”— google.github.io/tensorstore
    โญ 1,257

  78. ydataai/ydata-synthetic
    Synthetic data generators for tabular and time-series data
    ๐Ÿ”— docs.synthetic.ydata.ai
    โญ 1,223

  79. pytorch/data
    A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
    โญ 1,053

  80. uber/fiber
    Distributed Computing for AI Made Simple
    ๐Ÿ”— uber.github.io/fiber
    โญ 1,041

  81. brettkromkamp/contextualise
    Contextualise is an effective tool particularly suited for organising information-heavy projects and activities consisting of unstructured and widely diverse data and information resources
    ๐Ÿ”— contextualise.dev
    โญ 1,026

  82. eliasdabbas/advertools
    advertools - online marketing productivity and analysis tools
    ๐Ÿ”— advertools.readthedocs.io
    โญ 999

  83. aio-libs/aiocache
    Asyncio cache manager for redis, memcached and memory
    ๐Ÿ”— aiocache.readthedocs.io
    โญ 993

  84. intake/intake
    Intake is a lightweight package for finding, investigating, loading and disseminating data.
    ๐Ÿ”— intake.readthedocs.io
    โญ 971

  85. huggingface/datatrove
    Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
    โญ 966

  86. scikit-hep/awkward
    Manipulate JSON-like data with NumPy-like idioms.
    ๐Ÿ”— awkward-array.org
    โญ 773

  87. koaning/human-learn
    Natural Intelligence is still a pretty good idea.
    ๐Ÿ”— koaning.github.io/human-learn
    โญ 768

  88. hyperqueryhq/whale
    ๐Ÿณ The stupidly simple CLI workspace for your data warehouse.
    ๐Ÿ”— rsyi.gitbook.io/whale
    โญ 725

  89. googleapis/python-bigquery
    Python Client for Google BigQuery
    โญ 695

  90. duckdb/dbt-duckdb
    dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
    โญ 673

  91. mcfunley/pugsql
    A HugSQL-inspired database library for Python
    ๐Ÿ”— pugsql.org
    โญ 656

  92. goccy/bigquery-emulator
    BigQuery emulator provides a way to launch a BigQuery server on your local machine for testing and development.
    โญ 650

  93. dgarnitz/vectorflow
    VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
    ๐Ÿ”— www.getvectorflow.com
    โญ 611

Debugging

Debugging and tracing tools.

  1. cool-rr/PySnooper
    Never use print for debugging again
    โญ 16,181

  2. gruns/icecream
    ๐Ÿฆ Never use print() to debug again.
    โญ 8,207

  3. shobrook/rebound
    Command-line tool that instantly fetches Stack Overflow results when an exception is thrown
    โญ 4,057

  4. inducer/pudb
    Full-screen console debugger for Python
    ๐Ÿ”— documen.tician.de/pudb
    โญ 2,830

  5. gotcha/ipdb
    Integration of IPython pdb
    โญ 1,796

  6. alexmojaki/heartrate
    Simple real time visualisation of the execution of a Python program.
    โญ 1,708

  7. alexmojaki/birdseye
    Graphical Python debugger which lets you easily view the values of all evaluated expressions
    ๐Ÿ”— birdseye.readthedocs.io
    โญ 1,621

  8. alexmojaki/snoop
    A powerful set of Python debugging tools, based on PySnooper
    โญ 1,167

  9. samuelcolvin/python-devtools
    Dev tools for python
    ๐Ÿ”— python-devtools.helpmanual.io
    โญ 917

Diffusion Text to Image

Text-to-image diffusion model libraries, tools and apps for generating images from natural language.

  1. automatic1111/stable-diffusion-webui
    Stable Diffusion web UI
    โญ 121,808

  2. compvis/stable-diffusion
    A latent text-to-image diffusion model
    ๐Ÿ”— ommer-lab.com/research/latent-diffusion-models
    โญ 63,611

  3. stability-ai/stablediffusion
    High-Resolution Image Synthesis with Latent Diffusion Models
    โญ 34,541

  4. lllyasviel/ControlNet
    Let us control diffusion models!
    โญ 26,648

  5. comfyanonymous/ComfyUI
    The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
    โญ 25,860

  6. huggingface/diffusers
    ๐Ÿค— Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
    ๐Ÿ”— huggingface.co/docs/diffusers
    โญ 21,047

  7. invoke-ai/InvokeAI
    InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multi...
    ๐Ÿ”— invoke-ai.github.io/invokeai
    โญ 20,664

  8. apple/ml-stable-diffusion
    Stable Diffusion with Core ML on Apple Silicon
    โญ 15,790

  9. borisdayma/dalle-mini
    DALLยทE Mini - Generate images from a text prompt
    ๐Ÿ”— www.craiyon.com
    โญ 14,529

  10. divamgupta/diffusionbee-stable-diffusion-ui
    Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.
    ๐Ÿ”— diffusionbee.com
    โญ 11,637

  11. lucidrains/DALLE2-pytorch
    Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
    โญ 10,643

  12. compvis/latent-diffusion
    High-Resolution Image Synthesis with Latent Diffusion Models
    โญ 9,893

  13. instantid/InstantID
    InstantID : Zero-shot Identity-Preserving Generation in Seconds ๐Ÿ”ฅ
    ๐Ÿ”— instantid.github.io
    โญ 7,927

  14. ashawkey/stable-dreamfusion
    Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
    โญ 7,562

  15. carson-katri/dream-textures
    Stable Diffusion built-in to Blender
    โญ 7,468

  16. xavierxiao/Dreambooth-Stable-Diffusion
    Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
    โญ 7,353

  17. facebookresearch/dinov2
    PyTorch code and models for the DINOv2 self-supervised learning method.
    โญ 7,321

  18. openai/consistency_models
    Official repo for consistency models.
    โญ 5,819

  19. timothybrooks/instruct-pix2pix
    PyTorch implementation of InstructPix2Pix, an instruction-based image editing model, based on the original CompVis/stable_diffusion repo.
    โญ 5,743

  20. idea-research/GroundingDINO
    Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
    ๐Ÿ”— arxiv.org/abs/2303.05499
    โญ 4,339

  21. nateraw/stable-diffusion-videos
    Create ๐Ÿ”ฅ videos with Stable Diffusion by exploring the latent space and morphing between text prompts
    โญ 4,016

  22. salesforce/BLIP
    PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
    โญ 3,991

  23. jina-ai/discoart
    ๐Ÿชฉ Create Disco Diffusion artworks in one line
    โญ 3,837

  24. lkwq007/stablediffusion-infinity
    Outpainting with Stable Diffusion on an infinite canvas
    โญ 3,755

  25. openai/glide-text2im
    GLIDE: a diffusion-based text-conditional image synthesis model
    โญ 3,415

  26. mlc-ai/web-stable-diffusion
    Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
    ๐Ÿ”— mlc.ai/web-stable-diffusion
    โญ 3,329

  27. saharmor/dalle-playground
    A playground to generate images from any text prompt using Stable Diffusion (past: using DALL-E Mini)
    โญ 2,755

  28. openai/improved-diffusion
    Release for Improved Denoising Diffusion Probabilistic Models
    โญ 2,579

  29. stability-ai/stability-sdk
    SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
    ๐Ÿ”— platform.stability.ai
    โญ 2,387

  30. divamgupta/stable-diffusion-tensorflow
    Stable Diffusion in TensorFlow / Keras
    โญ 1,556

  31. coyote-a/ultimate-upscale-for-automatic1111
    Ultimate SD Upscale extension for AUTOMATIC1111 Stable Diffusion web UI
    โญ 1,392

  32. nvlabs/prismer
    The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
    ๐Ÿ”— shikun.io/projects/prismer
    โญ 1,249

  33. chenyangqiqi/FateZero
    [ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
    ๐Ÿ”— fate-zero-edit.github.io
    โญ 1,002

  34. thereforegames/unprompted
    Templating language written for Stable Diffusion workflows. Available as an extension for the Automatic1111 WebUI.
    โญ 725

  35. sharonzhou/long_stable_diffusion
    Long-form text-to-images generation, using a pipeline of deep generative models (GPT-3 and Stable Diffusion)
    โญ 672

Finance

Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives.

  1. openbb-finance/OpenBBTerminal
    Investment Research for Everyone, Everywhere.
    ๐Ÿ”— my.openbb.co/app/terminal
    โญ 25,618

  2. quantopian/zipline
    Zipline, a Pythonic Algorithmic Trading Library
    ๐Ÿ”— www.zipline.io
    โญ 16,901

  3. microsoft/qlib
    Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, ...
    ๐Ÿ”— qlib.readthedocs.io/en/latest
    โญ 13,455

  4. mementum/backtrader
    Python Backtesting library for trading strategies
    ๐Ÿ”— www.backtrader.com
    โญ 12,555

  5. ranaroussi/yfinance
    Download market data from Yahoo! Finance's API
    ๐Ÿ”— aroussi.com/post/python-yahoo-finance
    โญ 11,315

  6. ai4finance-foundation/FinGPT
    Data-Centric FinGPT. Open-source for open finance! Revolutionize ๐Ÿ”ฅ We release the trained model on HuggingFace.
    ๐Ÿ”— discord.gg/trsr8sxpw5
    โญ 10,681

  7. ai4finance-foundation/FinRL
    FinRL: Financial Reinforcement Learning. ๐Ÿ”ฅ
    ๐Ÿ”— discord.gg/trsr8sxpw5
    โญ 8,786

  8. ta-lib/ta-lib-python
    Python wrapper for TA-Lib (http://ta-lib.org/).
    ๐Ÿ”— ta-lib.github.io/ta-lib-python
    โญ 8,763

  9. quantconnect/Lean
    Lean Algorithmic Trading Engine by QuantConnect (Python, C#)
    ๐Ÿ”— lean.io
    โญ 8,453

  10. quantopian/pyfolio
    Portfolio and risk analytics in Python
    ๐Ÿ”— quantopian.github.io/pyfolio
    โญ 5,344

  11. kernc/backtesting.py
    ๐Ÿ”Ž ๐Ÿ“ˆ ๐Ÿ ๐Ÿ’ฐ Backtest trading strategies in Python.
    ๐Ÿ”— kernc.github.io/backtesting.py
    โญ 4,588

  12. twopirllc/pandas-ta
    Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 130+ Indicators
    ๐Ÿ”— twopirllc.github.io/pandas-ta
    โญ 4,511

  13. gbeced/pyalgotrade
    Python Algorithmic Trading Library
    ๐Ÿ”— gbeced.github.io/pyalgotrade
    โญ 4,255

  14. google/tf-quant-finance
    High-performance TensorFlow library for quantitative finance.
    โญ 4,198

  15. ranaroussi/quantstats
    Portfolio analytics for quants, written in Python
    โญ 4,004

  16. borisbanushev/stockpredictionai
    In this noteboook I will create a complete process for predicting stock price movements. Follow along and we will achieve some pretty good results. For that purpose we will use a Generative Adversarial Network (GAN) with LSTM, a type of Recurrent Neural Network, as generator, and a Convolutional Neural Networ...
    โญ 3,891

  17. polakowo/vectorbt
    Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
    ๐Ÿ”— vectorbt.dev
    โญ 3,573

  18. cuemacro/finmarketpy
    Python library for backtesting trading strategies & analyzing financial markets (formerly pythalesians)
    ๐Ÿ”— www.cuemacro.com
    โญ 3,300

  19. matplotlib/mplfinance
    Financial Markets Data Visualization using Matplotlib
    ๐Ÿ”— pypi.org/project/mplfinance
    โญ 3,224

  20. quantopian/alphalens
    Performance analysis of predictive (alpha) stock factors
    ๐Ÿ”— quantopian.github.io/alphalens
    โญ 2,985

  21. zvtvz/zvt
    modular quant framework.
    ๐Ÿ”— zvt.readthedocs.io/en/latest
    โญ 2,838

  22. goldmansachs/gs-quant
    Python toolkit for quantitative finance
    ๐Ÿ”— developer.gs.com/discover/products/gs-quant
    โญ 2,338

  23. robcarver17/pysystemtrade
    Systematic Trading in python
    โญ 2,306

  24. quantopian/research_public
    Quantitative research and educational materials
    ๐Ÿ”— www.quantopian.com/lectures
    โญ 2,279

  25. pmorissette/bt
    bt - flexible backtesting for Python
    ๐Ÿ”— pmorissette.github.io/bt
    โญ 1,930

  26. blankly-finance/blankly
    ๐Ÿš€ ๐Ÿ’ธ Easily build, backtest and deploy your algo in just a few lines of code. Trade stocks, cryptos, and forex across exchanges w/ one package.
    ๐Ÿ”— package.blankly.finance
    โญ 1,903

  27. domokane/FinancePy
    A Python Finance Library that focuses on the pricing and risk-management of Financial Derivatives, including fixed-income, equity, FX and credit derivatives.
    ๐Ÿ”— financepy.com
    โญ 1,785

  28. pmorissette/ffn
    ffn - a financial function library for Python
    ๐Ÿ”— pmorissette.github.io/ffn
    โญ 1,727

  29. cuemacro/findatapy
    Python library to download market data via Bloomberg, Eikon, Quandl, Yahoo etc.
    โญ 1,530

  30. quantopian/empyrical
    Common financial risk and performance metrics. Used by zipline and pyfolio.
    ๐Ÿ”— quantopian.github.io/empyrical
    โญ 1,201

  31. idanya/algo-trader
    Trading bot with support for realtime trading, backtesting, custom strategies and much more.
    โญ 733

Game Development

Game development tools, engines and libraries.

  1. kitao/pyxel
    A retro game engine for Python
    โญ 12,957

  2. pygame/pygame
    ๐Ÿ๐ŸŽฎ pygame (the library) is a Free and Open Source python programming language library for making multimedia applications like games built on top of the excellent SDL library. C, Python, Native, OpenGL.
    ๐Ÿ”— www.pygame.org
    โญ 6,767

  3. panda3d/panda3d
    Powerful, mature open-source cross-platform game engine for Python and C++, developed by Disney and CMU
    ๐Ÿ”— www.panda3d.org
    โญ 4,181

  4. pokepetter/ursina
    A game engine powered by python and panda3d.
    ๐Ÿ”— pokepetter.github.io/ursina
    โญ 2,042

  5. pyglet/pyglet
    pyglet is a cross-platform windowing and multimedia library for Python, for developing games and other visually rich applications.
    ๐Ÿ”— pyglet.org
    โญ 1,703

  6. pythonarcade/arcade
    Easy to use Python library for creating 2D arcade games.
    ๐Ÿ”— arcade.academy
    โญ 1,593

GIS

Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections.

  1. domlysz/BlenderGIS
    Blender addons to make the bridge between Blender and geographic data
    โญ 7,079

  2. python-visualization/folium
    Python Data. Leaflet.js Maps.
    ๐Ÿ”— python-visualization.github.io/folium
    โญ 6,591

  3. gboeing/osmnx
    OSMnx is a Python package to easily download, model, analyze, and visualize street networks and other geospatial features from OpenStreetMap.
    ๐Ÿ”— osmnx.readthedocs.io
    โญ 4,581

  4. osgeo/gdal
    GDAL is an open source MIT licensed translator library for raster and vector geospatial data formats.
    ๐Ÿ”— gdal.org
    โญ 4,337

  5. geopandas/geopandas
    Python tools for geographic data
    ๐Ÿ”— geopandas.org
    โญ 4,066

  6. shapely/shapely
    Manipulation and analysis of geometric objects
    ๐Ÿ”— shapely.readthedocs.io/en/stable
    โญ 3,592

  7. holoviz/datashader
    Quickly and accurately render even the largest data.
    ๐Ÿ”— datashader.org
    โญ 3,160

  8. giswqs/geemap
    A Python package for interactive geospaital analysis and visualization with Google Earth Engine.
    ๐Ÿ”— geemap.org
    โญ 3,092

  9. opengeos/leafmap
    A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment
    ๐Ÿ”— leafmap.org
    โญ 2,858

  10. opengeos/segment-geospatial
    A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
    ๐Ÿ”— samgeo.gishub.org
    โญ 2,552

  11. google/earthengine-api
    Python and JavaScript bindings for calling the Earth Engine API.
    โญ 2,476

  12. microsoft/torchgeo
    TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
    ๐Ÿ”— www.osgeo.org/projects/torchgeo
    โญ 2,119

  13. rasterio/rasterio
    Rasterio reads and writes geospatial raster datasets
    ๐Ÿ”— rasterio.readthedocs.io
    โญ 2,091

  14. mcordts/cityscapesScripts
    README and scripts for the Cityscapes Dataset
    โญ 2,063

  15. azavea/raster-vision
    An open source library and framework for deep learning on satellite and aerial imagery.
    ๐Ÿ”— docs.rastervision.io
    โญ 1,966

  16. plant99/felicette
    Satellite imagery for dummies.
    โญ 1,814

  17. apache/sedona
    A cluster computing framework for processing large-scale geospatial data
    ๐Ÿ”— sedona.apache.org
    โญ 1,725

  18. jupyter-widgets/ipyleaflet
    A Jupyter - Leaflet.js bridge
    ๐Ÿ”— ipyleaflet.readthedocs.io
    โญ 1,440

  19. gboeing/osmnx-examples
    Gallery of OSMnx tutorials, usage examples, and feature demonstations.
    ๐Ÿ”— osmnx.readthedocs.io
    โญ 1,411

  20. microsoft/GlobalMLBuildingFootprints
    Worldwide building footprints derived from satellite imagery
    โญ 1,215

  21. residentmario/geoplot
    High-level geospatial data visualization library for Python.
    ๐Ÿ”— residentmario.github.io/geoplot/index.html
    โญ 1,107

  22. anitagraser/movingpandas
    Movement trajectory classes and functions built on top of GeoPandas
    ๐Ÿ”— movingpandas.org
    โญ 1,099

  23. sentinel-hub/eo-learn
    Earth observation processing framework for machine learning in Python
    ๐Ÿ”— eo-learn.readthedocs.io/en/latest
    โญ 1,067

  24. makepath/xarray-spatial
    Raster-based Spatial Analytics for Python
    ๐Ÿ”— xarray-spatial.org
    โญ 757

  25. osgeo/grass
    GRASS GIS - free and open-source geospatial processing engine
    ๐Ÿ”— grass.osgeo.org
    โญ 734

  26. scikit-mobility/scikit-mobility
    scikit-mobility: mobility analysis in Python
    ๐Ÿ”— scikit-mobility.github.io/scikit-mobility
    โญ 680

  27. developmentseed/titiler
    Build your own Raster dynamic map tile services
    ๐Ÿ”— developmentseed.org/titiler
    โญ 655

Graph

Graphs and network libraries: network analysis, graph machine learning, visualisation.

  1. networkx/networkx
    Network Analysis in Python
    ๐Ÿ”— networkx.org
    โญ 13,917

  2. stellargraph/stellargraph
    StellarGraph - Machine Learning on Graphs
    ๐Ÿ”— stellargraph.readthedocs.io
    โญ 2,849

  3. westhealth/pyvis
    Python package for creating and visualizing interactive network graphs.
    ๐Ÿ”— pyvis.readthedocs.io/en/latest
    โญ 878

  4. rampasek/GraphGPS
    Recipe for a General, Powerful, Scalable Graph Transformer
    โญ 558

GUI

Graphical user interface libraries and toolkits.

  1. pysimplegui/PySimpleGUI
    Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.
    ๐Ÿ”— www.pysimplegui.com
    โญ 13,024

  2. hoffstadt/DearPyGui
    Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies
    ๐Ÿ”— dearpygui.readthedocs.io/en/latest
    โญ 11,900

  3. parthjadhav/Tkinter-Designer
    An easy and fast way to create a Python GUI ๐Ÿ
    โญ 7,951

  4. r0x0r/pywebview
    Build GUI for your Python program with JavaScript, HTML, and CSS
    ๐Ÿ”— pywebview.flowrl.com
    โญ 4,061

  5. beeware/toga
    A Python native, OS native GUI toolkit.
    ๐Ÿ”— toga.readthedocs.io/en/latest
    โญ 4,038

  6. samuelcolvin/FastUI
    FastUI is a new way to build web application user interfaces defined by declarative Python code.
    ๐Ÿ”— fastui-demo.onrender.com
    โญ 3,655

  7. dddomodossola/remi
    Python REMote Interface library. Platform independent. In about 100 Kbytes, perfect for your diet.
    โญ 3,439

  8. wxwidgets/Phoenix
    wxPython's Project Phoenix. A new implementation of wxPython, better, stronger, faster than he was before.
    ๐Ÿ”— wxpython.org
    โญ 2,143

Jupyter

Jupyter and JupyterLab and Notebook tools, libraries and plugins.

  1. jupyterlab/jupyterlab
    JupyterLab computational environment.
    ๐Ÿ”— jupyterlab.readthedocs.io
    โญ 13,621

  2. jupyter/notebook
    Jupyter Interactive Notebook
    ๐Ÿ”— jupyter-notebook.readthedocs.io
    โญ 10,959

  3. mwouts/jupytext
    Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
    ๐Ÿ”— jupytext.readthedocs.io
    โญ 6,347

  4. nteract/papermill
    ๐Ÿ“š Parameterize, execute, and analyze notebooks
    ๐Ÿ”— papermill.readthedocs.io/en/latest
    โญ 5,543

  5. connorferster/handcalcs
    Python library for converting Python calculations into rendered latex.
    โญ 5,322

  6. voila-dashboards/voila
    Voilร  turns Jupyter notebooks into standalone web applications
    ๐Ÿ”— voila.readthedocs.io
    โญ 5,114

  7. executablebooks/jupyter-book
    Create beautiful, publication-quality books and documents from computational content.
    ๐Ÿ”— jupyterbook.org
    โญ 3,628

  8. jupyterlite/jupyterlite
    Wasm powered Jupyter running in the browser ๐Ÿ’ก
    ๐Ÿ”— jupyterlite.rtfd.io/en/stable/try/lab
    โญ 3,513

  9. jupyterlab/jupyterlab-desktop
    JupyterLab desktop application, based on Electron.
    โญ 3,252

  10. quantopian/qgrid
    An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
    โญ 3,011

  11. jupyter-widgets/ipywidgets
    Interactive Widgets for the Jupyter Notebook
    ๐Ÿ”— ipywidgets.readthedocs.io
    โญ 2,998

  12. jupyterlab/jupyter-ai
    A generative AI extension for JupyterLab
    ๐Ÿ”— jupyter-ai.readthedocs.io
    โญ 2,659

  13. jupyter/nbdime
    Tools for diffing and merging of Jupyter notebooks.
    ๐Ÿ”— nbdime.readthedocs.io
    โญ 2,567

  14. jupyter/nbviewer
    nbconvert as a web service: Render Jupyter Notebooks as static web pages
    ๐Ÿ”— nbviewer.jupyter.org
    โญ 2,149

  15. mito-ds/mito
    The mitosheet package, trymito.io, and other public Mito code.
    ๐Ÿ”— trymito.io
    โญ 2,142

  16. maartenbreddels/ipyvolume
    3d plotting for Python in the Jupyter notebook based on IPython widgets using WebGL
    โญ 1,896

  17. jupyter-lsp/jupyterlab-lsp
    Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename) using Language Server Protocol
    โญ 1,683

  18. jupyter/nbconvert
    Jupyter Notebook Conversion
    ๐Ÿ”— nbconvert.readthedocs.io
    โญ 1,633

  19. jupyter/nbgrader
    A system for assigning and grading notebooks
    ๐Ÿ”— nbgrader.readthedocs.io
    โญ 1,246

  20. nbqa-dev/nbQA
    Run ruff, isort, pyupgrade, mypy, pylint, flake8, and more on Jupyter Notebooks
    ๐Ÿ”— nbqa.readthedocs.io/en/latest/index.html
    โญ 940

  21. vizzuhq/ipyvizzu
    Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax.
    ๐Ÿ”— ipyvizzu.vizzuhq.com
    โญ 908

  22. aws/graph-notebook
    Library extending Jupyter notebooks to integrate with Apache TinkerPop, openCypher, and RDF SPARQL.
    ๐Ÿ”— github.com/aws/graph-notebook
    โญ 666

  23. linealabs/lineapy
    Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
    ๐Ÿ”— lineapy.org
    โญ 648

  24. koaning/drawdata
    Draw datasets from within Jupyter.
    ๐Ÿ”— calmcode.io/labs/drawdata.html
    โญ 644

LLMs and ChatGPT

Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover.

  1. torantulino/AutoGPT
    AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
    ๐Ÿ”— agpt.co
    โญ 157,985

  2. hwchase17/langchain
    ๐Ÿฆœ๐Ÿ”— Build context-aware reasoning applications
    ๐Ÿ”— python.langchain.com
    โญ 76,928

  3. nomic-ai/gpt4all
    gpt4all: run open-source LLMs anywhere
    ๐Ÿ”— gpt4all.io
    โญ 61,381

  4. xtekky/gpt4free
    The official gpt4free repository | various collection of powerful language models
    ๐Ÿ”— discord.gg/xfybzpxph5
    โญ 53,038

  5. ggerganov/llama.cpp
    Port of Facebook's LLaMA model in C/C++
    โญ 51,019

  6. facebookresearch/llama
    Inference code for LLaMA models
    โญ 49,947

  7. antonosika/gpt-engineer
    Specify what you want it to build, the AI asks for clarification, and then builds it.
    โญ 49,143

  8. imartinez/privateGPT
    Interact with your documents using the power of GPT, 100% privately, no data leaks
    ๐Ÿ”— docs.privategpt.dev
    โญ 47,686

  9. killianlucas/open-interpreter
    A natural language interface for computers
    ๐Ÿ”— openinterpreter.com
    โญ 40,594

  10. thudm/ChatGLM-6B
    ChatGLM-6B: An Open Bilingual Dialogue Language Model | ๅผ€ๆบๅŒ่ฏญๅฏน่ฏ่ฏญ่จ€ๆจกๅž‹
    โญ 37,842

  11. hpcaitech/ColossalAI
    Making large AI models cheaper, faster and more accessible
    ๐Ÿ”— www.colossalai.org
    โญ 36,398

  12. laion-ai/Open-Assistant
    OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
    ๐Ÿ”— open-assistant.io
    โญ 36,317

  13. moymix/TaskMatrix
    Connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting.
    โญ 34,449

  14. geekan/MetaGPT
    ๐ŸŒŸ The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
    ๐Ÿ”— deepwisdom.ai
    โญ 34,383

  15. oobabooga/text-generation-webui
    A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
    โญ 33,030

  16. lm-sys/FastChat
    An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
    โญ 31,737

  17. karpathy/nanoGPT
    The simplest, fastest repository for training/finetuning medium-sized GPTs.
    โญ 29,333

  18. tatsu-lab/stanford_alpaca
    Code and documentation to train Stanford's Alpaca models, and generate the data.
    ๐Ÿ”— crfm.stanford.edu/2023/03/13/alpaca.html
    โญ 28,343

  19. jerryjliu/llama_index
    LlamaIndex (formerly GPT Index) is a data framework for your LLM applications
    ๐Ÿ”— docs.llamaindex.ai
    โญ 28,269

  20. vision-cair/MiniGPT-4
    Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
    ๐Ÿ”— minigpt-4.github.io
    โญ 24,502

  21. microsoft/JARVIS
    JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
    โญ 22,682

  22. microsoft/autogen
    Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
    ๐Ÿ”— microsoft.github.io/autogen
    โญ 21,993

  23. pythagora-io/gpt-pilot
    Dev tool that writes scalable apps from scratch while the developer oversees the implementation
    โญ 21,461

  24. openai/gpt-2
    Code for the paper "Language Models are Unsupervised Multitask Learners"
    ๐Ÿ”— openai.com/blog/better-language-models
    โญ 20,639

  25. openai/chatgpt-retrieval-plugin
    The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
    โญ 20,588

  26. yoheinakajima/babyagi
    GPT-4 powered task-driven autonomous agent
    โญ 18,565

  27. tloen/alpaca-lora
    Instruct-tune LLaMA on consumer hardware
    โญ 17,895

  28. karpathy/minGPT
    A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
    โญ 17,788

  29. rasahq/rasa
    ๐Ÿ’ฌ Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
    ๐Ÿ”— rasa.com/docs/rasa
    โญ 17,618

  30. microsoft/semantic-kernel
    Integrate cutting-edge LLM technology quickly and easily into your apps
    ๐Ÿ”— aka.ms/semantic-kernel
    โญ 16,831

  31. guidance-ai/guidance
    A guidance language for controlling large language models.
    โญ 16,280

  32. mlc-ai/mlc-llm
    Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
    ๐Ÿ”— llm.mlc.ai/docs
    โญ 15,667

  33. thudm/ChatGLM2-6B
    ChatGLM2-6B: An Open Bilingual Chat LLM | ๅผ€ๆบๅŒ่ฏญๅฏน่ฏ่ฏญ่จ€ๆจกๅž‹
    โญ 15,081

  34. vllm-project/vllm
    A high-throughput and memory-efficient inference and serving engine for LLMs
    ๐Ÿ”— docs.vllm.ai
    โญ 14,894

  35. logspace-ai/langflow
    โ›“๏ธ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.
    ๐Ÿ”— www.langflow.org
    โญ 14,601

  36. karpathy/llama2.c
    Inference Llama 2 in one file of pure C
    โญ 14,299

  37. mayooear/gpt4-pdf-chatbot-langchain
    GPT4 & LangChain Chatbot for large PDF docs
    ๐Ÿ”— www.youtube.com/watch?v=ih9pbgvvoo4
    โญ 14,215

  38. haotian-liu/LLaVA
    [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
    ๐Ÿ”— llava.hliu.cc
    โญ 14,019

  39. fauxpilot/fauxpilot
    FauxPilot - an open-source alternative to GitHub Copilot server
    โญ 14,000

  40. transformeroptimus/SuperAGI
    <โšก๏ธ> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
    ๐Ÿ”— superagi.com
    โญ 13,799

  41. facebookresearch/codellama
    Inference code for CodeLlama models
    โญ 13,533

  42. openai/evals
    Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
    โญ 13,283

  43. deepset-ai/haystack
    ๐Ÿ” LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conv...
    ๐Ÿ”— haystack.deepset.ai
    โญ 12,728

  44. idea-research/Grounded-Segment-Anything
    Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
    ๐Ÿ”— arxiv.org/abs/2401.14159
    โญ 12,652

  45. huggingface/peft
    ๐Ÿค— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
    ๐Ÿ”— huggingface.co/docs/peft
    โญ 12,588

  46. hiyouga/LLaMA-Factory
    Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
    โญ 11,777

  47. hiyouga/LLaMA-Factory
    Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
    โญ 11,777

  48. openlmlab/MOSS
    An open-source tool-augmented conversational language model from Fudan University
    ๐Ÿ”— txsun1997.github.io/blogs/moss.html
    โญ 11,736

  49. smol-ai/developer
    the first library to let you embed a developer agent in your own app!
    ๐Ÿ”— twitter.com/smolmodels
    โญ 11,501

  50. blinkdl/RWKV-LM
    RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
    โญ 11,083

  51. paddlepaddle/PaddleNLP
    ๐Ÿ‘‘ Easy-to-use and powerful NLP and LLM library with ๐Ÿค— Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including ๐Ÿ—‚Text Classification, ๐Ÿ” Neural Search, โ“ Question Answering, โ„น๏ธ Information Extraction, ๐Ÿ“„ Document Intelligence, ๐Ÿ’Œ Sentiment Analysis etc.
    ๐Ÿ”— paddlenlp.readthedocs.io
    โญ 11,003

  52. databrickslabs/dolly
    Databricksโ€™ Dolly, a large language model trained on the Databricks Machine Learning Platform
    ๐Ÿ”— www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html
    โญ 10,720

  53. h2oai/h2ogpt
    Private Q&A and summarization of documents+images or chat with local GPT, 100% private, Apache 2.0. Supports Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/
    ๐Ÿ”— h2o.ai
    โญ 9,767

  54. dao-ailab/flash-attention
    Fast and memory-efficient exact attention
    โญ 9,586

  55. blinkdl/ChatRWKV
    ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
    โญ 9,155

  56. shishirpatil/gorilla
    Enables LLMs to use tools by invoking APIs. Given a query, Gorilla comes up with the semantically and syntactically correct API.
    ๐Ÿ”— gorilla.cs.berkeley.edu
    โญ 8,982

  57. artidoro/qlora
    QLoRA: Efficient Finetuning of Quantized LLMs
    ๐Ÿ”— arxiv.org/abs/2305.14314
    โญ 8,903

  58. mlc-ai/web-llm
    Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
    ๐Ÿ”— mlc.ai/web-llm
    โญ 8,598

  59. microsoft/LoRA
    Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
    ๐Ÿ”— arxiv.org/abs/2106.09685
    โญ 8,224

  60. eleutherai/gpt-neo
    An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
    ๐Ÿ”— www.eleuther.ai
    โญ 8,111

  61. joaomdmoura/crewAI
    Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
    ๐Ÿ”— crewai.io
    โญ 7,832

  62. nvidia/Megatron-LM
    Ongoing research training transformer models at scale
    โญ 7,813

  63. embedchain/embedchain
    The Open Source RAG framework
    ๐Ÿ”— docs.embedchain.ai
    โญ 7,747

  64. optimalscale/LMFlow
    An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
    ๐Ÿ”— optimalscale.github.io/lmflow
    โญ 7,631

  65. thudm/CodeGeeX
    CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
    ๐Ÿ”— codegeex.cn
    โญ 7,603

  66. mistralai/mistral-src
    Reference implementation of Mistral AI 7B v0.1 model.
    ๐Ÿ”— mistral.ai
    โญ 7,581

  67. thudm/GLM-130B
    GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
    โญ 7,532

  68. facebookresearch/llama-recipes
    Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcas...
    โญ 7,504

  69. apple/ml-ferret
    Ferret: Refer and Ground Anything Anywhere at Any Granularity
    โญ 7,396

  70. lvwerra/trl
    Train transformer language models with reinforcement learning.
    ๐Ÿ”— hf.co/docs/trl
    โญ 7,392

  71. microsoft/promptflow
    Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
    ๐Ÿ”— microsoft.github.io/promptflow
    โญ 7,359

  72. lianjiatech/BELLE
    BELLE: Be Everyone's Large Language model Engine๏ผˆๅผ€ๆบไธญๆ–‡ๅฏน่ฏๅคงๆจกๅž‹๏ผ‰
    โญ 7,271

  73. openlm-research/open_llama
    OpenLLaMA: An Open Reproduction of LLaMA
    โญ 7,084

  74. assafelovic/gpt-researcher
    GPT based autonomous agent that does online comprehensive research on any given topic
    ๐Ÿ”— tavily.com
    โญ 7,042

  75. huggingface/text-generation-inference
    Large Language Model Text Generation Inference
    ๐Ÿ”— hf.co/docs/text-generation-inference
    โญ 6,994

  76. plachtaa/VALL-E-X
    An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
    โญ 6,868

  77. bigcode-project/starcoder
    Home of StarCoder: fine-tuning & inference!
    โญ 6,867

  78. sweepai/sweep
    Sweep: AI-powered Junior Developer for small features and bug fixes.
    ๐Ÿ”— sweep.dev
    โญ 6,764

  79. stanfordnlp/dspy
    DSPy: The framework for programmingโ€”not promptingโ€”foundation models
    โญ 6,613

  80. sjtu-ipads/PowerInfer
    High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
    โญ 6,564

  81. bhaskatripathi/pdfGPT
    PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
    ๐Ÿ”— bhaskartripathi-pdfgpt-turbo.hf.space
    โญ 6,533

  82. eleutherai/gpt-neox
    An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
    โญ 6,413

  83. jzhang38/TinyLlama
    The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
    โญ 6,126

  84. zilliztech/GPTCache
    Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
    ๐Ÿ”— gptcache.readthedocs.io
    โญ 6,113

  85. nat/openplayground
    An LLM playground you can run on your laptop
    โญ 5,990

  86. mit-han-lab/streaming-llm
    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks
    ๐Ÿ”— arxiv.org/abs/2309.17453
    โญ 5,956

  87. langchain-ai/opengpts
    An open source effort to create a similar experience to OpenAI's GPTs and Assistants API.
    โญ 5,650

  88. lightning-ai/lit-llama
    Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
    โญ 5,612

  89. abetlen/llama-cpp-python
    Python bindings for llama.cpp
    ๐Ÿ”— llama-cpp-python.readthedocs.io
    โญ 5,597

  90. run-llama/rags
    Build ChatGPT over your data, all with natural language
    โญ 5,525

  91. berriai/litellm
    Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
    ๐Ÿ”— docs.litellm.ai/docs
    โญ 5,500

  92. danielmiessler/fabric
    fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
    ๐Ÿ”— danielmiessler.com/p/fabric-origin-story
    โญ 5,481

  93. vaibhavs10/insanely-fast-whisper
    An opinionated CLI to transcribe Audio files w/ Whisper on-device! Powered by ๐Ÿค— Transformers, Optimum & flash-attn
    โญ 5,479

  94. dsdanielpark/Bard-API
    The unofficial python package that returns response of Google Bard through cookie value.
    ๐Ÿ”— pypi.org/project/bardapi
    โญ 5,399

  95. skypilot-org/skypilot
    SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed executionโ€”all with a simple interface.
    ๐Ÿ”— skypilot.readthedocs.io
    โญ 5,171

  96. lightning-ai/lit-gpt
    Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
    โญ 5,017

  97. karpathy/minbpe
    Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
    โญ 4,995

  98. minedojo/Voyager
    An Open-Ended Embodied Agent with Large Language Models
    ๐Ÿ”— voyager.minedojo.org
    โญ 4,894

  99. pytorch-labs/gpt-fast
    Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
    โญ 4,702

  100. chainlit/chainlit
    Build Conversational AI in minutes โšก๏ธ
    ๐Ÿ”— docs.chainlit.io
    โญ 4,681

  101. microsoft/promptbase
    promptbase is an evolving collection of resources, best practices, and example scripts for eliciting the best performance from foundation models.
    โญ 4,679

  102. langchain-ai/chat-langchain
    Locally hosted chatbot specifically focused on question answering over the LangChain documentation
    ๐Ÿ”— chat.langchain.com
    โญ 4,399

  103. microsoft/BioGPT
    Implementation of BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining
    โญ 4,189

  104. togethercomputer/RedPajama-Data
    The RedPajama-Data repository contains code for preparing large datasets for training large language models.
    โญ 4,152

  105. openbmb/ToolBench
    [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
    ๐Ÿ”— openbmb.github.io/toolbench
    โญ 4,114

  106. eleutherai/lm-evaluation-harness
    A framework for few-shot evaluation of language models.
    ๐Ÿ”— www.eleuther.ai
    โญ 4,028

  107. kyegomez/tree-of-thoughts
    Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
    ๐Ÿ”— discord.gg/qutxnk2nmf
    โญ 3,841

  108. ravenscroftj/turbopilot
    Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU
    โญ 3,838

  109. instruction-tuning-with-gpt-4/GPT-4-LLM
    Instruction Tuning with GPT-4
    ๐Ÿ”— instruction-tuning-with-gpt-4.github.io
    โญ 3,824

  110. mnotgod96/AppAgent
    AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
    ๐Ÿ”— appagent-official.github.io
    โญ 3,756

  111. 1rgs/jsonformer
    A Bulletproof Way to Generate Structured JSON from Language Models
    โญ 3,598

  112. yizhongw/self-instruct
    Aligning pretrained language models with instruction data generated by themselves.
    โญ 3,560

  113. mmabrouk/llm-workflow-engine
    Power CLI and Workflow manager for LLMs (core package)
    โญ 3,549

  114. unslothai/unsloth
    5X faster 60% less memory QLoRA finetuning
    ๐Ÿ”— unsloth.ai
    โญ 3,471

  115. whitead/paper-qa
    LLM Chain for answering questions from documents with citations
    โญ 3,471

  116. luodian/Otter
    ๐Ÿฆฆ Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
    ๐Ÿ”— otter-ntu.github.io
    โญ 3,363

  117. minimaxir/gpt-2-simple
    Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
    โญ 3,360

  118. h2oai/h2o-llmstudio
    H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/
    ๐Ÿ”— gpt-gm.h2o.ai
    โญ 3,351

  119. minimaxir/simpleaichat
    Python package for easily interfacing with chat apps, with robust features and minimal code complexity.
    โญ 3,281

  120. deep-diver/LLM-As-Chatbot
    LLM as a Chatbot Service
    โญ 3,193

  121. eth-sri/lmql
    A language for constraint-guided and efficient LLM programming.
    ๐Ÿ”— lmql.ai
    โญ 3,061

  122. nvidia/NeMo-Guardrails
    NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
    โญ 3,026

  123. allenai/OLMo
    OLMo is a repository for training and using AI2's state-of-the-art open language models. It is built by scientists, for scientists.
    ๐Ÿ”— allenai.org/olmo
    โญ 2,996

  124. microsoft/LMOps
    General technology for enabling AI capabilities w/ LLMs and MLLMs
    ๐Ÿ”— aka.ms/generalai
    โญ 2,954

  125. guardrails-ai/guardrails
    Open-source Python package for specifying structure and type, validating and correcting the outputs of large language models (LLMs)
    ๐Ÿ”— docs.guardrailsai.com
    โญ 2,917

  126. baichuan-inc/Baichuan-13B
    A 13B large language model developed by Baichuan Intelligent Technology
    ๐Ÿ”— huggingface.co/baichuan-inc/baichuan-13b-chat
    โญ 2,894

  127. iryna-kondr/scikit-llm
    Seamlessly integrate LLMs into scikit-learn.
    ๐Ÿ”— beastbyte.ai
    โญ 2,870

  128. microsoft/LLMLingua
    To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
    ๐Ÿ”— llmlingua.com
    โญ 2,858

  129. microsoft/torchscale
    Foundation Architecture for (M)LLMs
    ๐Ÿ”— aka.ms/generalai
    โญ 2,825

  130. freedomintelligence/LLMZoo
    โšกLLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.โšก
    โญ 2,808

  131. mshumer/gpt-llm-trainer
    Input a description of your task, and the system will generate a dataset, parse it, and fine-tune a LLaMA 2 model for you
    โญ 2,800

  132. juncongmoo/pyllama
    LLaMA: Open and Efficient Foundation Language Models
    โญ 2,760

  133. next-gpt/NExT-GPT
    Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
    ๐Ÿ”— next-gpt.github.io
    โญ 2,669

  134. paperswithcode/galai
    Model API for GALACTICA
    โญ 2,625

  135. llmware-ai/llmware
    Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.
    ๐Ÿ”— pypi.org/project/llmware
    โญ 2,598

  136. li-plus/chatglm.cpp
    C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs
    โญ 2,353

  137. defog-ai/sqlcoder
    SoTA LLM for converting natural language questions to SQL queries
    โญ 2,333

  138. phidatahq/phidata
    Phidata is a toolkit for building AI Assistants using function calling.
    ๐Ÿ”— docs.phidata.com
    โญ 2,312

  139. cg123/mergekit
    Tools for merging pretrained large language models.
    โญ 2,286

  140. ofa-sys/OFA
    Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
    โญ 2,259

  141. civitai/sd_civitai_extension
    All of the Civitai models inside Automatic 1111 Stable Diffusion Web UI
    โญ 2,211

  142. alpha-vllm/LLaMA2-Accessory
    An Open-source Toolkit for LLM Development
    ๐Ÿ”— llama2-accessory.readthedocs.io
    โญ 2,183

  143. hegelai/prompttools
    Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
    ๐Ÿ”— prompttools.readthedocs.io
    โญ 2,165

  144. young-geng/EasyLM
    Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
    โญ 2,129

  145. openai/finetune-transformer-lm
    Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
    ๐Ÿ”— s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf
    โญ 2,028

  146. openai/image-gpt
    Archived. Code and models from the paper "Generative Pretraining from Pixels"
    โญ 1,990

  147. open-compass/opencompass
    OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
    ๐Ÿ”— opencompass.org.cn
    โญ 1,945

  148. openai/gpt-2-output-dataset
    Dataset of GPT-2 outputs for research in detection, biases, and more
    โญ 1,861

  149. pathwaycom/llm-app
    LLM App templates for RAG, knowledge mining, and stream analytics. Ready to run with Docker,โšกin sync with your data sources.
    ๐Ÿ”— pathway.com/developers/showcases/llm-app-pathway
    โญ 1,847

  150. lucidrains/toolformer-pytorch
    Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
    โญ 1,840

  151. minimaxir/aitextgen
    A robust Python tool for text-based AI training and generation using GPT-2.
    ๐Ÿ”— docs.aitextgen.io
    โญ 1,825

  152. tairov/llama2.mojo
    Inference Llama 2 in one file of pure ๐Ÿ”ฅ
    ๐Ÿ”— www.modular.com/blog/community-spotlight-how-i-built-llama2-by-aydyn-tairov
    โญ 1,818

  153. intel/neural-compressor
    SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
    ๐Ÿ”— intel.github.io/neural-compressor
    โญ 1,803

  154. neulab/prompt2model
    prompt2model - Generate Deployable Models from Natural Language Instructions
    โญ 1,785

  155. noahshinn/reflexion
    [NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
    โญ 1,747

  156. weaviate/Verba
    Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
    โญ 1,742

  157. spcl/graph-of-thoughts
    Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
    ๐Ÿ”— arxiv.org/pdf/2308.09687.pdf
    โญ 1,729

  158. cheshire-cat-ai/core
    Production ready AI assistant framework
    ๐Ÿ”— cheshirecat.ai
    โญ 1,724

  159. openai/gpt-discord-bot
    Example Discord bot written in Python that uses the completions API to have conversations with the text-davinci-003 model, and the moderations API to filter the messages.
    โญ 1,656

  160. huggingface/text-embeddings-inference
    A blazing fast inference solution for text embeddings models
    ๐Ÿ”— huggingface.co/docs/text-embeddings-inference/quick_tour
    โญ 1,647

  161. bclavie/RAGatouille
    Bridging the gap between state-of-the-art research and alchemical RAG pipeline practices.
    โญ 1,634

  162. ist-daslab/gptq
    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
    ๐Ÿ”— arxiv.org/abs/2210.17323
    โญ 1,587

  163. microsoft/Megatron-DeepSpeed
    Ongoing research training transformer language models at scale, including: BERT & GPT-2
    โญ 1,432

  164. jina-ai/thinkgpt
    Agent techniques to augment your LLM and push it beyong its limits
    โญ 1,432

  165. epfllm/meditron
    Meditron is a suite of open-source medical Large Language Models (LLMs).
    ๐Ÿ”— huggingface.co/epfl-llm
    โญ 1,414

  166. cstankonrad/long_llama
    LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
    โญ 1,390

  167. langchain-ai/langgraph
    LangGraph is a library for building stateful, multi-actor applications with LLMs, built on top of (and intended to be used with) LangChain.
    โญ 1,367

  168. farizrahman4u/loopgpt
    Re-implementation of Auto-GPT as a python package, written with modularity and extensibility in mind.
    โญ 1,359

  169. explosion/spacy-transformers
    ๐Ÿ›ธ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
    ๐Ÿ”— spacy.io/usage/embeddings-transformers
    โญ 1,309

  170. ray-project/llm-applications
    A comprehensive guide to building RAG-based LLM applications for production.
    โญ 1,308

  171. truera/trulens
    Evaluation and Tracking for LLM Experiments
    ๐Ÿ”— www.trulens.org
    โญ 1,246

  172. run-llama/llama-lab
    Llama Lab is a repo dedicated to building cutting-edge projects using LlamaIndex
    โญ 1,216

  173. bigscience-workshop/Megatron-DeepSpeed
    Ongoing research training transformer language models at scale, including: BERT & GPT-2
    โญ 1,168

  174. chatarena/chatarena
    ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
    ๐Ÿ”— www.chatarena.org
    โญ 1,167

  175. srush/MiniChain
    A tiny library for coding with large language models.
    ๐Ÿ”— srush-minichain.hf.space
    โญ 1,162

  176. akariasai/self-rag
    This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
    ๐Ÿ”— selfrag.github.io
    โญ 1,116

  177. ibm/Dromedary
    Dromedary: towards helpful, ethical and reliable LLMs.
    โญ 1,055

  178. nomic-ai/pygpt4all
    Official supported Python bindings for llama.cpp + gpt4all
    ๐Ÿ”— nomic-ai.github.io/pygpt4all
    โญ 1,022

  179. ray-project/ray-llm
    RayLLM - LLMs on Ray
    ๐Ÿ”— aviary.anyscale.com
    โญ 1,009

  180. rlancemartin/auto-evaluator
    Evaluation tool for LLM QA chains
    ๐Ÿ”— autoevaluator.langchain.com
    โญ 993

  181. lupantech/chameleon-llm
    Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
    ๐Ÿ”— chameleon-llm.github.io
    โญ 985

  182. ctlllll/LLM-ToolMaker
    Large Language Models as Tool Makers
    โญ 973

  183. microsoft/Llama-2-Onnx
    A Microsoft optimized version of the Llama 2 model, available from Meta
    โญ 947

  184. linksoul-ai/AutoAgents
    Generate different roles for GPTs to form a collaborative entity for complex tasks.
    ๐Ÿ”— huggingface.co/spaces/linksoul/autoagents
    โญ 936

  185. ajndkr/lanarky
    The web framework for building LLM microservices
    ๐Ÿ”— lanarky.ajndkr.com
    โญ 917

  186. predibase/lorax
    Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
    ๐Ÿ”— predibase.github.io/lorax
    โญ 902

  187. keirp/automatic_prompt_engineer
    Large Language Models Are Human-Level Prompt Engineers
    โญ 901

  188. hao-ai-lab/LookaheadDecoding
    Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
    โญ 865

  189. explosion/spacy-llm
    ๐Ÿฆ™ Integrating LLMs into structured NLP pipelines
    ๐Ÿ”— spacy.io/usage/large-language-models
    โญ 845

  190. cerebras/modelzoo
    Examples of common deep learning models that can be trained on Cerebras hardware
    โญ 802

  191. muennighoff/sgpt
    SGPT: GPT Sentence Embeddings for Semantic Search
    ๐Ÿ”— arxiv.org/abs/2202.08904
    โญ 776

  192. meetkai/functionary
    Chat language model that can interpret and execute functions/plugins
    โญ 765

  193. oliveirabruno01/babyagi-asi
    BabyAGI: an Autonomous and Self-Improving agent, or BASI
    โญ 741

  194. opengenerativeai/GenossGPT
    One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace ...) ๐ŸŒˆ๐Ÿ‚ Replace OpenAI GPT with any LLMs in your app with one line.
    ๐Ÿ”— genoss.ai
    โญ 710

  195. agenta-ai/agenta
    The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.
    ๐Ÿ”— www.agenta.ai
    โญ 705

  196. salesforce/xgen
    Salesforce open-source LLMs with 8k sequence length.
    โญ 702

  197. huggingface/nanotron
    Minimalistic large language model 3D-parallelism training
    โญ 579

  198. hazyresearch/ama_prompting
    Ask Me Anything language model prompting
    โญ 524

  199. squeezeailab/SqueezeLLM
    SqueezeLLM: Dense-and-Sparse Quantization
    ๐Ÿ”— arxiv.org/abs/2306.07629
    โญ 516

  200. zhudotexe/kani
    kani (ใ‚ซใƒ‹) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)
    ๐Ÿ”— kani.readthedocs.io
    โญ 515

  201. lupantech/ScienceQA
    Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".
    โญ 510

  202. continuum-llms/chatgpt-memory
    Allows to scale the ChatGPT API to multiple simultaneous sessions with infinite contextual and adaptive memory powered by GPT and Redis datastore.
    โญ 510

Math and Science

Maths and science libraries.

  1. numpy/numpy
    The fundamental package for scientific computing with Python.
    ๐Ÿ”— numpy.org
    โญ 25,749

  2. scipy/scipy
    SciPy library main repository
    ๐Ÿ”— scipy.org
    โญ 12,203

  3. sympy/sympy
    A computer algebra system written in pure Python
    ๐Ÿ”— sympy.org
    โญ 11,856

  4. cupy/cupy
    NumPy & SciPy for GPU
    ๐Ÿ”— cupy.dev
    โญ 7,562

  5. google-deepmind/alphageometry
    Solving Olympiad Geometry without Human Demonstrations
    โญ 3,353

  6. mckinsey/causalnex
    A Python library that helps data scientists to infer causation rather than observing correlation.
    ๐Ÿ”— causalnex.readthedocs.io
    โญ 2,100

  7. pyomo/pyomo
    An object-oriented algebraic modeling language in Python for structured optimization problems.
    ๐Ÿ”— www.pyomo.org
    โญ 1,785

  8. facebookresearch/theseus
    A library for differentiable nonlinear optimization
    โญ 1,563

  9. google-research/torchsde
    Differentiable SDE solvers with GPU support and efficient sensitivity analysis.
    โญ 1,424

  10. dynamicslab/pysindy
    A package for the sparse identification of nonlinear dynamical systems from data
    ๐Ÿ”— pysindy.readthedocs.io/en/latest
    โญ 1,214

  11. geomstats/geomstats
    Computations and statistics on manifolds with geometric structures.
    ๐Ÿ”— geomstats.ai
    โญ 1,122

  12. cma-es/pycma
    pycma is a Python implementation of CMA-ES and a few related numerical optimization tools.
    โญ 995

  13. sj001/AI-Feynman
    Implementation of AI Feynman: a Physics-Inspired Method for Symbolic Regression
    โญ 569

  14. willianfuks/tfcausalimpact
    Python Causal Impact Implementation Based on Google's R Package. Built using TensorFlow Probability.
    โญ 550

Machine Learning - General

General and classical machine learning libraries. See below for other sections covering specialised ML areas.

  1. scikit-learn/scikit-learn
    scikit-learn: machine learning in Python
    ๐Ÿ”— scikit-learn.org
    โญ 57,432

  2. openai/openai-cookbook
    Examples and guides for using the OpenAI API
    ๐Ÿ”— cookbook.openai.com
    โญ 54,014

  3. tencentarc/GFPGAN
    GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
    โญ 34,028

  4. google-research/google-research
    This repository contains code released by Google Research
    ๐Ÿ”— research.google
    โญ 32,181

  5. suno-ai/bark
    ๐Ÿ”Š Text-Prompted Generative Audio Model
    โญ 30,568

  6. open-mmlab/mmdetection
    OpenMMLab Detection Toolbox and Benchmark
    ๐Ÿ”— mmdetection.readthedocs.io
    โญ 27,046

  7. facebookresearch/faiss
    A library for efficient similarity search and clustering of dense vectors.
    ๐Ÿ”— faiss.ai
    โญ 26,893

  8. google/jax
    Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
    ๐Ÿ”— jax.readthedocs.io
    โญ 26,672

  9. ageron/handson-ml2
    A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
    โญ 26,518

  10. lutzroeder/netron
    Visualizer for neural network, deep learning and machine learning models
    ๐Ÿ”— netron.app
    โญ 25,464

  11. dmlc/xgboost
    Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
    ๐Ÿ”— xgboost.readthedocs.io/en/stable
    โญ 25,319

  12. google/mediapipe
    Cross-platform, customizable ML solutions for live and streaming media.
    ๐Ÿ”— mediapipe.dev
    โญ 24,835

  13. harisiqbal88/PlotNeuralNet
    Latex code for making neural networks diagrams
    โญ 20,696

  14. jina-ai/jina
    โ˜๏ธ Build multimodal AI applications with cloud-native stack
    ๐Ÿ”— docs.jina.ai
    โญ 19,687

  15. onnx/onnx
    Open standard for machine learning interoperability
    ๐Ÿ”— onnx.ai
    โญ 16,425

  16. microsoft/LightGBM
    A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
    ๐Ÿ”— lightgbm.readthedocs.io/en/latest
    โญ 15,865

  17. tensorflow/tensor2tensor
    Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
    โญ 14,568

  18. ddbourgin/numpy-ml
    Machine learning, in numpy
    ๐Ÿ”— numpy-ml.readthedocs.io
    โญ 14,435

  19. aleju/imgaug
    Image augmentation for machine learning experiments.
    ๐Ÿ”— imgaug.readthedocs.io
    โญ 14,026

  20. microsoft/nni
    An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
    ๐Ÿ”— nni.readthedocs.io
    โญ 13,593

  21. ml-explore/mlx
    MLX is an array framework for machine learning on Apple silicon, brought to you by Apple machine learning research.
    โญ 12,650

  22. jindongwang/transferlearning
    Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-่ฟ็งปๅญฆไน 
    ๐Ÿ”— transferlearning.xyz
    โญ 12,594

  23. deepmind/deepmind-research
    This repository contains implementations and illustrative code to accompany DeepMind publications
    โญ 12,517

  24. spotify/annoy
    Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
    โญ 12,464

  25. microsoft/Swin-Transformer
    This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
    ๐Ÿ”— arxiv.org/abs/2103.14030
    โญ 12,461

  26. microsoft/onnxruntime
    ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
    ๐Ÿ”— onnxruntime.ai
    โญ 12,000

  27. deepmind/alphafold
    Implementation of the inference pipeline of AlphaFold v2
    โญ 11,388

  28. neonbjb/tortoise-tts
    A multi-voice TTS system trained with an emphasis on quality
    โญ 11,060

  29. facebookresearch/AnimatedDrawings
    Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
    โญ 9,984

  30. twitter/the-algorithm-ml
    Source code for Twitter's Recommendation Algorithm
    ๐Ÿ”— blog.twitter.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm
    โญ 9,826

  31. roboflow/supervision
    We write your reusable computer vision tools. ๐Ÿ’œ
    ๐Ÿ”— supervision.roboflow.com
    โญ 9,626

  32. epistasislab/tpot
    A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
    ๐Ÿ”— epistasislab.github.io/tpot
    โญ 9,419

  33. statsmodels/statsmodels
    Statsmodels: statistical modeling and econometrics in Python
    ๐Ÿ”— www.statsmodels.org/devel
    โญ 9,315

  34. optuna/optuna
    A hyperparameter optimization framework
    ๐Ÿ”— optuna.org
    โญ 9,296

  35. ggerganov/ggml
    Tensor library for machine learning
    โญ 8,928

  36. megvii-basedetection/YOLOX
    YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
    โญ 8,858

  37. pycaret/pycaret
    An open-source, low-code machine learning library in Python
    ๐Ÿ”— www.pycaret.org
    โญ 8,215

  38. pymc-devs/pymc
    Bayesian Modeling and Probabilistic Programming in Python
    ๐Ÿ”— docs.pymc.io
    โญ 8,030

  39. cleanlab/cleanlab
    The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
    ๐Ÿ”— cleanlab.ai
    โญ 7,926

  40. uberi/speech_recognition
    Speech recognition module for Python, supporting several engines and APIs, online and offline.
    ๐Ÿ”— pypi.python.org/pypi/speechrecognition
    โญ 7,890

  41. wandb/wandb
    ๐Ÿ”ฅ A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
    ๐Ÿ”— wandb.ai
    โญ 7,852

  42. catboost/catboost
    A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
    ๐Ÿ”— catboost.ai
    โญ 7,605

  43. automl/auto-sklearn
    Automated Machine Learning with scikit-learn
    ๐Ÿ”— automl.github.io/auto-sklearn
    โญ 7,331

  44. open-mmlab/mmsegmentation
    OpenMMLab Semantic Segmentation Toolbox and Benchmark.
    ๐Ÿ”— mmsegmentation.readthedocs.io/en/latest
    โญ 7,073

  45. facebookresearch/xformers
    Hackable and optimized Transformers building blocks, supporting a composable construction.
    ๐Ÿ”— facebookresearch.github.io/xformers
    โญ 7,058

  46. hyperopt/hyperopt
    Distributed Asynchronous Hyperparameter Optimization in Python
    ๐Ÿ”— hyperopt.github.io/hyperopt
    โญ 7,006

  47. featurelabs/featuretools
    An open source python library for automated feature engineering
    ๐Ÿ”— www.featuretools.com
    โญ 6,961

  48. awslabs/autogluon
    AutoGluon: AutoML for Image, Text, Time Series, and Tabular Data
    ๐Ÿ”— auto.gluon.ai
    โญ 6,839

  49. lmcinnes/umap
    Uniform Manifold Approximation and Projection
    โญ 6,786

  50. hips/autograd
    Efficiently computes derivatives of numpy code.
    โญ 6,688

  51. scikit-learn-contrib/imbalanced-learn
    A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
    ๐Ÿ”— imbalanced-learn.org
    โญ 6,643

  52. py-why/dowhy
    DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
    ๐Ÿ”— www.pywhy.org/dowhy
    โญ 6,588

  53. huggingface/accelerate
    ๐Ÿš€ A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
    ๐Ÿ”— huggingface.co/docs/accelerate
    โญ 6,530

  54. open-mmlab/mmagic
    OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic ๐Ÿช„: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
    ๐Ÿ”— mmagic.readthedocs.io/en/latest
    โญ 6,366

  55. probml/pyprobml
    Python code for "Probabilistic Machine learning" book by Kevin Murphy
    โญ 6,149

  56. nicolashug/Surprise
    A Python scikit for building and analyzing recommender systems
    ๐Ÿ”— surpriselib.com
    โญ 6,126

  57. google/automl
    Google Brain AutoML
    โญ 6,111

  58. cleverhans-lab/cleverhans
    An adversarial example library for constructing attacks, building defenses, and benchmarking both
    โญ 6,025

  59. kevinmusgrave/pytorch-metric-learning
    The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
    ๐Ÿ”— kevinmusgrave.github.io/pytorch-metric-learning
    โญ 5,658

  60. open-mmlab/mmcv
    OpenMMLab Computer Vision Foundation
    ๐Ÿ”— mmcv.readthedocs.io/en/latest
    โญ 5,466

  61. project-monai/MONAI
    AI Toolkit for Healthcare Imaging
    ๐Ÿ”— monai.io
    โญ 5,057

  62. mdbloice/Augmentor
    Image augmentation library in Python for machine learning.
    ๐Ÿ”— augmentor.readthedocs.io/en/stable
    โญ 5,005

  63. rasbt/mlxtend
    A library of extension and helper modules for Python's data analysis and machine learning libraries.
    ๐Ÿ”— rasbt.github.io/mlxtend
    โญ 4,721

  64. online-ml/river
    ๐ŸŒŠ Online machine learning in Python
    ๐Ÿ”— riverml.xyz
    โญ 4,638

  65. uber/causalml
    Uplift modeling and causal inference with machine learning algorithms
    โญ 4,589

  66. lucidrains/deep-daze
    Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun
    โญ 4,381

  67. districtdatalabs/yellowbrick
    Visual analysis and diagnostic tools to facilitate machine learning model selection.
    ๐Ÿ”— www.scikit-yb.org
    โญ 4,169

  68. ml-explore/mlx-examples
    Examples in the MLX framework
    โญ 4,099

  69. nv-tlabs/GET3D
    Generative Model of High Quality 3D Textured Shapes Learned from Images
    โญ 4,033

  70. skvark/opencv-python
    Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
    ๐Ÿ”— pypi.org/project/opencv-python
    โญ 4,003

  71. marqo-ai/marqo
    Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
    ๐Ÿ”— www.marqo.ai
    โญ 3,958

  72. apple/coremltools
    Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
    ๐Ÿ”— coremltools.readme.io
    โญ 3,939

  73. google-deepmind/graphcast
    GraphCast: Learning skillful medium-range global weather forecasting
    โญ 3,930

  74. sanchit-gandhi/whisper-jax
    JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
    โญ 3,903

  75. nmslib/hnswlib
    Header-only C++/python library for fast approximate nearest neighbors
    ๐Ÿ”— github.com/nmslib/hnswlib
    โญ 3,849

  76. cmusphinx/pocketsphinx
    A small speech recognizer
    โญ 3,670

  77. ourownstory/neural_prophet
    NeuralProphet: A simple forecasting package
    ๐Ÿ”— neuralprophet.com
    โญ 3,555

  78. microsoft/FLAML
    A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
    ๐Ÿ”— microsoft.github.io/flaml
    โญ 3,549

  79. py-why/EconML
    ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to brin...
    ๐Ÿ”— www.microsoft.com/en-us/research/project/alice
    โญ 3,449

  80. thudm/CogVideo
    Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"
    โญ 3,397

  81. facebookresearch/vissl
    VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
    ๐Ÿ”— vissl.ai
    โญ 3,199

  82. huggingface/notebooks
    Notebooks using the Hugging Face libraries ๐Ÿค—
    โญ 3,112

  83. pytorch/glow
    Compiler for Neural Network hardware accelerators
    โญ 3,101

  84. rucaibox/RecBole
    A unified, comprehensive and efficient recommendation library
    ๐Ÿ”— recbole.io
    โญ 3,076

  85. hrnet/HRNet-Semantic-Segmentation
    The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
    โญ 3,018

  86. yoheinakajima/instagraph
    Converts text input or URL into knowledge graph and displays
    โญ 3,017

  87. lucidrains/musiclm-pytorch
    Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
    โญ 2,918

  88. mljar/mljar-supervised
    Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
    ๐Ÿ”— mljar.com
    โญ 2,887

  89. huggingface/autotrain-advanced
    AutoTrain Advanced: faster and easier training and deployments of state-of-the-art machine learning models
    ๐Ÿ”— huggingface.co/autotrain
    โญ 2,798

  90. zjunlp/DeepKE
    An Open Toolkit for Knowledge Graph Extraction and Construction published at EMNLP2022 System Demonstrations.
    ๐Ÿ”— deepke.zjukg.cn
    โญ 2,718

  91. teamhg-memex/eli5
    A library for debugging/inspecting machine learning classifiers and explaining their predictions
    ๐Ÿ”— eli5.readthedocs.io
    โญ 2,718

  92. scikit-optimize/scikit-optimize
    Sequential model-based optimization with a scipy.optimize interface
    ๐Ÿ”— scikit-optimize.github.io
    โญ 2,710

  93. lightly-ai/lightly
    A python library for self-supervised learning on images.
    ๐Ÿ”— docs.lightly.ai/self-supervised-learning
    โญ 2,688

  94. scikit-learn-contrib/hdbscan
    A high performance implementation of HDBSCAN clustering.
    ๐Ÿ”— hdbscan.readthedocs.io/en/latest
    โญ 2,629

  95. maif/shapash
    ๐Ÿ”… Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
    ๐Ÿ”— maif.github.io/shapash
    โญ 2,607

  96. shankarpandala/lazypredict
    Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning
    โญ 2,506

  97. apple/ml-ane-transformers
    Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)
    โญ 2,434

  98. scikit-learn-contrib/category_encoders
    A library of sklearn compatible categorical variable encoders
    ๐Ÿ”— contrib.scikit-learn.org/category_encoders
    โญ 2,345

  99. google-research/t5x
    T5X is a modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models (starting with language) at many scales.
    โญ 2,338

  100. freedmand/semantra
    Semantra is a multipurpose tool for semantically searching documents. Query by meaning rather than just by matching text.
    โญ 2,174

  101. aws/sagemaker-python-sdk
    A library for training and deploying machine learning models on Amazon SageMaker
    ๐Ÿ”— sagemaker.readthedocs.io
    โญ 2,010

  102. rom1504/clip-retrieval
    Easily compute clip embeddings and build a clip retrieval system with them
    ๐Ÿ”— rom1504.github.io/clip-retrieval
    โญ 1,983

  103. huggingface/optimum
    ๐Ÿš€ Accelerate training and inference of ๐Ÿค— Transformers and ๐Ÿค— Diffusers with easy to use hardware optimization tools
    ๐Ÿ”— huggingface.co/docs/optimum/main
    โญ 1,975

  104. rentruewang/koila
    Prevent PyTorch's CUDA error: out of memory in just 1 line of code.
    ๐Ÿ”— rentruewang.github.io/koila
    โญ 1,809

  105. contextlab/hypertools
    A Python toolbox for gaining geometric insights into high-dimensional data
    ๐Ÿ”— hypertools.readthedocs.io/en/latest
    โญ 1,794

  106. linkedin/greykite
    A flexible, intuitive and fast forecasting library
    โญ 1,773

  107. bmabey/pyLDAvis
    Python library for interactive topic model visualization. Port of the R LDAvis package.
    โญ 1,766

  108. huggingface/evaluate
    ๐Ÿค— Evaluate: A library for easily evaluating machine learning models and datasets.
    ๐Ÿ”— huggingface.co/docs/evaluate
    โญ 1,719

  109. scikit-learn-contrib/lightning
    Large-scale linear classification, regression and ranking in Python
    ๐Ÿ”— contrib.scikit-learn.org/lightning
    โญ 1,699

  110. tensorflow/addons
    Useful extra functionality for TensorFlow 2.x maintained by SIG-addons
    โญ 1,680

  111. microsoft/i-Code
    The ambition of the i-Code project is to build integrative and composable multimodal AI. The "i" stands for integrative multimodal learning.
    โญ 1,578

  112. huggingface/huggingface_hub
    The official Python client for the Huggingface Hub.
    ๐Ÿ”— huggingface.co/docs/huggingface_hub
    โญ 1,511

  113. kubeflow/katib
    Repository for hyperparameter tuning
    โญ 1,400

  114. jina-ai/finetuner
    ๐ŸŽฏ Task-oriented embedding tuning for BERT, CLIP, etc.
    ๐Ÿ”— finetuner.jina.ai
    โญ 1,395

  115. scikit-learn-contrib/metric-learn
    Metric learning algorithms in Python
    ๐Ÿ”— contrib.scikit-learn.org/metric-learn
    โญ 1,362

  116. visual-layer/fastdup
    fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
    โญ 1,346

  117. castorini/pyserini
    Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
    ๐Ÿ”— pyserini.io
    โญ 1,332

  118. eric-mitchell/direct-preference-optimization
    Reference implementation for DPO (Direct Preference Optimization)
    โญ 1,332

  119. laekov/fastmoe
    A fast MoE impl for PyTorch
    ๐Ÿ”— fastmoe.ai
    โญ 1,287

  120. csinva/imodels
    Interpretable ML package ๐Ÿ” for concise, transparent, and accurate predictive modeling (sklearn-compatible).
    ๐Ÿ”— csinva.io/imodels
    โญ 1,257

  121. borealisai/advertorch
    A Toolbox for Adversarial Robustness Research
    โญ 1,254

  122. awslabs/dgl-ke
    High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
    ๐Ÿ”— dglke.dgl.ai/doc
    โญ 1,214

  123. googlecloudplatform/vertex-ai-samples
    Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud
    ๐Ÿ”— cloud.google.com/vertex-ai
    โญ 1,213

  124. google/vizier
    Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
    ๐Ÿ”— oss-vizier.readthedocs.io
    โญ 1,144

  125. microsoft/Olive
    Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.
    ๐Ÿ”— microsoft.github.io/olive
    โญ 1,116

  126. koaning/scikit-lego
    Extra blocks for scikit-learn pipelines.
    ๐Ÿ”— koaning.github.io/scikit-lego
    โญ 1,108

  127. spotify/voyager
    ๐Ÿ›ฐ๏ธ Voyager is an approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
    ๐Ÿ”— spotify.github.io/voyager
    โญ 1,092

  128. microsoft/Semi-supervised-learning
    A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
    ๐Ÿ”— usb.readthedocs.io
    โญ 1,091

  129. automl/TabPFN
    Official implementation of the TabPFN paper (https://arxiv.org/abs/2207.01848) and the tabpfn package.
    ๐Ÿ”— priorlabs.ai
    โญ 1,052

  130. patchy631/machine-learning
    Machine Learning Tutorials Repository
    โญ 977

  131. google-research/deeplab2
    DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel labeling tasks.
    โญ 973

  132. dask/dask-ml
    Scalable Machine Learning with Dask
    ๐Ÿ”— ml.dask.org
    โญ 877

  133. lmcinnes/pynndescent
    A Python nearest neighbor descent for approximate nearest neighbors
    โญ 826

  134. hazyresearch/safari
    Convolutions for Sequence Modeling
    โญ 817

  135. davidmrau/mixture-of-experts
    PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538
    โญ 752

  136. oml-team/open-metric-learning
    OML is a PyTorch-based framework to train and validate the models producing high-quality embeddings.
    ๐Ÿ”— open-metric-learning.readthedocs.io/en/latest/index.html
    โญ 737

  137. criteo/autofaiss
    Automatically create Faiss knn indices with the most optimal similarity search parameters.
    ๐Ÿ”— criteo.github.io/autofaiss
    โญ 709

  138. nvidia/cuda-python
    CUDA Python Low-level Bindings
    ๐Ÿ”— nvidia.github.io/cuda-python
    โญ 704

  139. facebookresearch/balance
    The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to some target population of interest.
    ๐Ÿ”— import-balance.org
    โญ 663

  140. opentensor/bittensor
    Internet-scale Neural Networks
    ๐Ÿ”— www.bittensor.com
    โญ 657

  141. hpcaitech/EnergonAI
    Large-scale model inference.
    โญ 630

  142. awslabs/python-deequ
    Python API for Deequ, a library built on Spark for defining "unit tests for data", which measure data quality in large datasets
    โญ 625

  143. qdrant/fastembed
    Fast, Accurate, Lightweight Python library to make State of the Art Embedding
    ๐Ÿ”— qdrant.github.io/fastembed
    โญ 604

  144. qdrant/quaterion
    Blazing fast framework for fine-tuning similarity learning models
    ๐Ÿ”— quaterion.qdrant.tech
    โญ 603

  145. nicolas-hbt/pygraft
    Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
    ๐Ÿ”— pygraft.readthedocs.io/en/latest
    โญ 559

  146. replicate/replicate-python
    Python client for Replicate
    ๐Ÿ”— replicate.com
    โญ 546

  147. microsoft/Focal-Transformer
    [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"
    โญ 537

  148. nevronai/MetisFL
    The first open Federated Learning framework implemented in C++ and Python.
    ๐Ÿ”— metisfl.org
    โญ 528

  149. cvxgrp/pymde
    Minimum-distortion embedding with PyTorch
    ๐Ÿ”— pymde.org
    โญ 507

  150. huggingface/exporters
    Export Hugging Face models to Core ML and TensorFlow Lite
    โญ 501

Machine Learning - Deep Learning

Machine learning libraries that cross over with deep learning in some way.

  1. tensorflow/tensorflow
    An Open Source Machine Learning Framework for Everyone
    ๐Ÿ”— tensorflow.org
    โญ 180,771

  2. pytorch/pytorch
    Tensors and Dynamic neural networks in Python with strong GPU acceleration
    ๐Ÿ”— pytorch.org
    โญ 75,531

  3. keras-team/keras
    Deep Learning for humans
    ๐Ÿ”— keras.io
    โญ 60,309

  4. openai/whisper
    Robust Speech Recognition via Large-Scale Weak Supervision
    โญ 56,198

  5. deepfakes/faceswap
    Deepfakes Software For All
    ๐Ÿ”— www.faceswap.dev
    โญ 48,528

  6. iperov/DeepFaceLab
    DeepFaceLab is the leading software for creating deepfakes.
    โญ 44,567

  7. facebookresearch/segment-anything
    The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
    โญ 42,393

  8. microsoft/DeepSpeed
    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
    ๐Ÿ”— www.deepspeed.ai
    โญ 31,420

  9. rwightman/pytorch-image-models
    PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
    ๐Ÿ”— huggingface.co/docs/timm
    โญ 28,754

  10. facebookresearch/detectron2
    Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
    ๐Ÿ”— detectron2.readthedocs.io/en/latest
    โญ 28,081

  11. pytorchlightning/pytorch-lightning
    Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
    ๐Ÿ”— lightning.ai
    โญ 26,251

  12. facebookresearch/Detectron
    FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
    โญ 26,090

  13. xinntao/Real-ESRGAN
    Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
    โญ 25,134

  14. matterport/Mask_RCNN
    Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
    โญ 23,927

  15. paddlepaddle/Paddle
    PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice ๏ผˆใ€Ž้ฃžๆกจใ€ๆ ธๅฟƒๆก†ๆžถ๏ผŒๆทฑๅบฆๅญฆไน &ๆœบๅ™จๅญฆไน ้ซ˜ๆ€ง่ƒฝๅ•ๆœบใ€ๅˆ†ๅธƒๅผ่ฎญ็ปƒๅ’Œ่ทจๅนณๅฐ้ƒจ็ฝฒ๏ผ‰
    ๐Ÿ”— www.paddlepaddle.org
    โญ 21,365

  16. openai/CLIP
    CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
    โญ 20,736

  17. apache/mxnet
    Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
    ๐Ÿ”— mxnet.apache.org
    โญ 20,687

  18. pyg-team/pytorch_geometric
    Graph Neural Network Library for PyTorch
    ๐Ÿ”— pyg.org
    โญ 19,597

  19. lucidrains/vit-pytorch
    Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
    โญ 16,995

  20. rasbt/deeplearning-models
    A collection of various deep learning architectures, models, and tips
    โญ 16,197

  21. sanster/IOPaint
    Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
    ๐Ÿ”— www.iopaint.com
    โญ 15,878

  22. danielgatis/rembg
    Rembg is a tool to remove images background
    โญ 13,238

  23. albumentations-team/albumentations
    Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
    ๐Ÿ”— albumentations.ai
    โญ 13,128

  24. dmlc/dgl
    Python package built to ease deep learning on graph, on top of existing DL frameworks.
    ๐Ÿ”— dgl.ai
    โญ 12,770

  25. facebookresearch/detr
    End-to-End Object Detection with Transformers
    โญ 12,497

  26. nvidia/DeepLearningExamples
    State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
    โญ 12,254

  27. kornia/kornia
    Geometric Computer Vision Library for Spatial AI
    ๐Ÿ”— kornia.readthedocs.io
    โญ 9,115

  28. keras-team/autokeras
    AutoML library for deep learning
    ๐Ÿ”— autokeras.com
    โญ 9,042

  29. pyro-ppl/pyro
    Deep universal probabilistic programming with Python and PyTorch
    ๐Ÿ”— pyro.ai
    โญ 8,272

  30. facebookresearch/pytorch3d
    PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
    ๐Ÿ”— pytorch3d.org
    โญ 8,088

  31. google/trax
    Trax โ€” Deep Learning with Clear Code and Speed
    โญ 7,886

  32. nvidia/apex
    A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
    โญ 7,862

  33. arogozhnikov/einops
    Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
    ๐Ÿ”— einops.rocks
    โญ 7,674

  34. facebookresearch/ImageBind
    ImageBind One Embedding Space to Bind Them All
    โญ 7,654

  35. lucidrains/imagen-pytorch
    Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
    โญ 7,643

  36. xpixelgroup/BasicSR
    Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
    ๐Ÿ”— basicsr.readthedocs.io/en/latest
    โญ 5,915

  37. skorch-dev/skorch
    A scikit-learn compatible neural network library that wraps PyTorch
    โญ 5,569

  38. facebookresearch/mmf
    A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
    ๐Ÿ”— mmf.sh
    โญ 5,377

  39. google/flax
    Flax is a neural network library for JAX that is designed for flexibility.
    ๐Ÿ”— flax.readthedocs.io
    โญ 5,211

  40. mosaicml/composer
    Supercharge Your Model Training
    ๐Ÿ”— docs.mosaicml.com
    โญ 4,833

  41. pytorch/ignite
    High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
    ๐Ÿ”— pytorch-ignite.ai
    โญ 4,422

  42. facebookincubator/AITemplate
    AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
    โญ 4,391

  43. deci-ai/super-gradients
    Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
    ๐Ÿ”— www.supergradients.com
    โญ 4,193

  44. williamyang1991/VToonify
    [SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
    โญ 3,432

  45. facebookresearch/PyTorch-BigGraph
    Generate embeddings from large-scale graph-structured data.
    ๐Ÿ”— torchbiggraph.readthedocs.io
    โญ 3,339

  46. alpa-projects/alpa
    Training and serving large-scale neural networks with auto parallelization.
    ๐Ÿ”— alpa.ai
    โญ 2,947

  47. pytorch/botorch
    Bayesian optimization in PyTorch
    ๐Ÿ”— botorch.org
    โญ 2,898

  48. cvg/LightGlue
    LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
    โญ 2,779

  49. explosion/thinc
    ๐Ÿ”ฎ A refreshing functional take on deep learning, compatible with your favorite libraries
    ๐Ÿ”— thinc.ai
    โญ 2,776

  50. deepmind/dm-haiku
    JAX-based neural network library
    ๐Ÿ”— dm-haiku.readthedocs.io
    โญ 2,697

  51. nerdyrodent/VQGAN-CLIP
    Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
    โญ 2,547

  52. danielegrattarola/spektral
    Graph Neural Networks with Keras and Tensorflow 2.
    ๐Ÿ”— graphneural.network
    โญ 2,324

  53. google-research/electra
    ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
    โญ 2,274

  54. neuralmagic/sparseml
    Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
    โญ 1,924

  55. fepegar/torchio
    Medical imaging toolkit for deep learning
    ๐Ÿ”— www.torchio.org
    โญ 1,921

  56. pytorch/torchrec
    Pytorch domain library for recommendation systems
    โญ 1,658

  57. vt-vl-lab/FGVC
    [ECCV 2020] Flow-edge Guided Video Completion
    โญ 1,524

  58. tensorflow/mesh
    Mesh TensorFlow: Model Parallelism Made Easier
    โญ 1,503

  59. tensorly/tensorly
    TensorLy: Tensor Learning in Python.
    ๐Ÿ”— tensorly.org
    โญ 1,479

  60. calculatedcontent/WeightWatcher
    The WeightWatcher tool for predicting the accuracy of Deep Neural Networks
    โญ 1,368

  61. hysts/pytorch_image_classification
    PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet
    โญ 1,293

  62. xl0/lovely-tensors
    Tensors, ready for human consumption
    ๐Ÿ”— xl0.github.io/lovely-tensors
    โญ 1,026

  63. tensorflow/similarity
    TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
    โญ 993

  64. jeshraghian/snntorch
    Deep and online learning with spiking neural networks in Python
    ๐Ÿ”— snntorch.readthedocs.io/en/latest
    โญ 956

  65. deepmind/android_env
    RL research on Android devices.
    โญ 939

  66. keras-team/keras-cv
    Industry-strength Computer Vision workflows with Keras
    โญ 903

  67. deepmind/chex
    Chex is a library of utilities for helping to write reliable JAX code
    ๐Ÿ”— chex.readthedocs.io
    โญ 684

  68. kakaobrain/rq-vae-transformer
    The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
    โญ 655

  69. whitead/dmol-book
    Deep learning for molecules and materials book
    ๐Ÿ”— dmol.pub
    โญ 561

Machine Learning - Interpretability

Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training.

  1. slundberg/shap
    A game theoretic approach to explain the output of any machine learning model.
    ๐Ÿ”— shap.readthedocs.io
    โญ 21,186

  2. marcotcr/lime
    Lime: Explaining the predictions of any machine learning classifier
    โญ 11,145

  3. interpretml/interpret
    Fit interpretable models. Explain blackbox machine learning.
    ๐Ÿ”— interpret.ml/docs
    โญ 5,914

  4. tensorflow/lucid
    A collection of infrastructure and tools for research in neural network interpretability.
    โญ 4,600

  5. pytorch/captum
    Model interpretability and understanding for PyTorch
    ๐Ÿ”— captum.ai
    โญ 4,443

  6. pair-code/lit
    The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
    ๐Ÿ”— pair-code.github.io/lit
    โญ 3,289

  7. seldonio/alibi
    Algorithms for explaining machine learning models
    ๐Ÿ”— docs.seldon.io/projects/alibi/en/stable
    โญ 2,271

  8. oegedijk/explainerdashboard
    Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
    ๐Ÿ”— explainerdashboard.readthedocs.io
    โญ 2,147

  9. arize-ai/phoenix
    AI Observability & Evaluation - Evaluate, troubleshoot, and fine tune your LLM, CV, and NLP models in a notebook.
    ๐Ÿ”— docs.arize.com/phoenix
    โญ 2,134

  10. eleutherai/pythia
    Interpretability analysis and scaling laws to understand how knowledge develops and evolves during training in autoregressive transformers
    โญ 1,886

  11. jalammar/ecco
    Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
    ๐Ÿ”— ecco.readthedocs.io
    โญ 1,868

  12. cdpierse/transformers-interpret
    Model explainability that works seamlessly with ๐Ÿค— transformers. Explain your transformers model in just 2 lines of code.
    โญ 1,180

  13. selfexplainml/PiML-Toolbox
    PiML (Python Interpretable Machine Learning) toolbox for model development & diagnostics
    ๐Ÿ”— selfexplainml.github.io/piml-toolbox
    โญ 820

Machine Learning - Ops

MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models.

  1. apache/airflow
    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
    ๐Ÿ”— airflow.apache.org
    โญ 33,587

  2. ray-project/ray
    Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
    ๐Ÿ”— ray.io
    โญ 29,976

  3. spotify/luigi
    Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
    โญ 17,132

  4. mlflow/mlflow
    Open source platform for the machine learning lifecycle
    ๐Ÿ”— mlflow.org
    โญ 16,793

  5. prefecthq/prefect
    Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
    ๐Ÿ”— prefect.io
    โญ 14,077

  6. horovod/horovod
    Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
    ๐Ÿ”— horovod.ai
    โญ 13,812

  7. iterative/dvc
    ๐Ÿฆ‰ ML Experiments Management with Git
    ๐Ÿ”— dvc.org
    โญ 12,890

  8. ludwig-ai/ludwig
    Low-code framework for building custom LLMs, neural networks, and other AI models
    ๐Ÿ”— ludwig.ai
    โญ 10,538

  9. dagster-io/dagster
    An orchestration platform for the development, production, and observation of data assets.
    ๐Ÿ”— dagster.io
    โญ 9,660

  10. great-expectations/great_expectations
    Always know what to expect from your data.
    ๐Ÿ”— docs.greatexpectations.io
    โญ 9,264

  11. kedro-org/kedro
    Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
    ๐Ÿ”— kedro.org
    โญ 9,206

  12. dbt-labs/dbt-core
    dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
    ๐Ÿ”— getdbt.com
    โญ 8,529

  13. bentoml/OpenLLM
    Operating LLMs in production
    ๐Ÿ”— bentoml.com
    โญ 8,175

  14. activeloopai/deeplake
    Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
    ๐Ÿ”— activeloop.ai
    โญ 7,495

  15. netflix/metaflow
    ๐Ÿš€ Build and manage real-life ML, AI, and data science projects with ease!
    ๐Ÿ”— metaflow.org
    โญ 7,384

  16. mage-ai/mage-ai
    ๐Ÿง™ The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
    ๐Ÿ”— www.mage.ai
    โญ 6,554

  17. bentoml/BentoML
    Build Production-Grade AI Applications
    ๐Ÿ”— bentoml.com
    โญ 6,310

  18. kestra-io/kestra
    Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
    ๐Ÿ”— kestra.io
    โญ 5,357

  19. feast-dev/feast
    Feature Store for Machine Learning
    ๐Ÿ”— feast.dev
    โญ 5,107

  20. allegroai/clearml
    ClearML - Auto-Magical CI/CD to streamline your ML workflow. Experiment Manager, MLOps and Data-Management
    ๐Ÿ”— clear.ml/docs
    โญ 5,076

  21. aimhubio/aim
    Aim ๐Ÿ’ซ โ€” An easy-to-use & supercharged open-source experiment tracker.
    ๐Ÿ”— aimstack.io
    โญ 4,615

  22. flyteorg/flyte
    Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
    ๐Ÿ”— flyte.org
    โญ 4,538

  23. evidentlyai/evidently
    Evaluate and monitor ML models from validation to production. Join our Discord: https://discord.com/invite/xZjKRaNp8b
    โญ 4,414

  24. orchest/orchest
    Build data pipelines, the easy way ๐Ÿ› ๏ธ
    ๐Ÿ”— orchest.readthedocs.io/en/stable
    โญ 4,012

  25. adap/flower
    Flower: A Friendly Federated Learning Framework
    ๐Ÿ”— flower.ai
    โญ 3,736

  26. zenml-io/zenml
    ZenML ๐Ÿ™: Build portable, production-ready MLOps pipelines. https://zenml.io.
    ๐Ÿ”— zenml.io
    โญ 3,521

  27. polyaxon/polyaxon
    MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
    ๐Ÿ”— polyaxon.com
    โญ 3,452

  28. kubeflow/pipelines
    Machine Learning Pipelines for Kubeflow
    ๐Ÿ”— www.kubeflow.org/docs/components/pipelines
    โญ 3,380

  29. ploomber/ploomber
    The fastest โšก๏ธ way to build data pipelines. Develop iteratively, deploy anywhere. โ˜๏ธ
    ๐Ÿ”— docs.ploomber.io
    โญ 3,335

  30. towhee-io/towhee
    Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
    ๐Ÿ”— towhee.io
    โญ 2,908

  31. determined-ai/determined
    Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
    ๐Ÿ”— determined.ai
    โญ 2,751

  32. leptonai/leptonai
    A Pythonic framework to simplify AI service building
    ๐Ÿ”— lepton.ai
    โญ 2,170

  33. meltano/meltano
    Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
    ๐Ÿ”— meltano.com
    โญ 1,504

  34. hi-primus/optimus
    ๐Ÿšš Agile Data Preparation Workflows madeย easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
    ๐Ÿ”— hi-optimus.com
    โญ 1,429

  35. kubeflow/examples
    A repository to host extended examples and tutorials
    โญ 1,363

  36. dagworks-inc/hamilton
    Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
    ๐Ÿ”— hamilton.dagworks.io/en/latest
    โญ 1,186

  37. dstackai/dstack
    dstack is an open-source engine for running GPU workloads. It simplifies development, training, and deployment of gen AI models on any cloud. Discord: https://discord.gg/u8SmfwPpMd
    ๐Ÿ”— dstack.ai
    โญ 1,001

  38. tensorflow/data-validation
    Library for exploring and validating machine learning data
    โญ 743

  39. kedro-org/kedro-viz
    Visualise your Kedro data and machine-learning pipelines and track your experiments.
    ๐Ÿ”— demo.kedro.org
    โญ 629

  40. nccr-itmo/FEDOT
    Automated modeling and machine learning framework FEDOT
    ๐Ÿ”— fedot.readthedocs.io
    โญ 590

Machine Learning - Reinforcement

Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF

  1. openai/gym
    A toolkit for developing and comparing reinforcement learning algorithms.
    ๐Ÿ”— www.gymlibrary.dev
    โญ 33,546

  2. unity-technologies/ml-agents
    The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
    ๐Ÿ”— unity.com/products/machine-learning-agents
    โญ 16,021

  3. openai/baselines
    OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
    โญ 15,177

  4. google/dopamine
    Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
    ๐Ÿ”— github.com/google/dopamine
    โญ 10,319

  5. deepmind/pysc2
    StarCraft II Learning Environment
    โญ 7,880

  6. lucidrains/PaLM-rlhf-pytorch
    Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
    โญ 7,547

  7. tensorlayer/TensorLayer
    Deep Learning and Reinforcement Learning Library for Scientists and Engineers
    ๐Ÿ”— tensorlayerx.com
    โญ 7,276

  8. thu-ml/tianshou
    An elegant PyTorch deep reinforcement learning library.
    ๐Ÿ”— tianshou.readthedocs.io
    โญ 7,159

  9. keras-rl/keras-rl
    Deep Reinforcement Learning for Keras.
    ๐Ÿ”— keras-rl.readthedocs.io
    โญ 5,473

  10. farama-foundation/Gymnasium
    An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
    ๐Ÿ”— gymnasium.farama.org
    โญ 5,033

  11. facebookresearch/ReAgent
    A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
    ๐Ÿ”— reagent.ai
    โญ 3,505

  12. deepmind/dm_control
    Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
    โญ 3,448

  13. deepmind/acme
    A library of reinforcement learning components and agents
    โญ 3,323

  14. ai4finance-foundation/ElegantRL
    Massively Parallel Deep Reinforcement Learning. ๐Ÿ”ฅ
    ๐Ÿ”— discord.gg/trsr8sxpw5
    โญ 3,319

  15. eureka-research/Eureka
    Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"
    ๐Ÿ”— eureka-research.github.io
    โญ 2,470

  16. pettingzoo-team/PettingZoo
    An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
    ๐Ÿ”— pettingzoo.farama.org
    โญ 2,260

  17. kzl/decision-transformer
    Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
    โญ 2,059

  18. pytorch/rl
    A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
    ๐Ÿ”— pytorch.org/rl
    โญ 1,712

  19. anthropics/hh-rlhf
    Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
    ๐Ÿ”— arxiv.org/abs/2204.05862
    โญ 1,347

  20. humancompatibleai/imitation
    Clean PyTorch implementations of imitation and reward learning algorithms
    ๐Ÿ”— imitation.readthedocs.io
    โญ 1,086

  21. arise-initiative/robosuite
    robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
    ๐Ÿ”— robosuite.ai
    โญ 1,013

Natural Language Processing

Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover.

  1. huggingface/transformers
    ๐Ÿค— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
    ๐Ÿ”— huggingface.co/transformers
    โญ 120,404

  2. pytorch/fairseq
    Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
    โญ 28,712

  3. explosion/spaCy
    ๐Ÿ’ซ Industrial-strength Natural Language Processing (NLP) in Python
    ๐Ÿ”— spacy.io
    โญ 28,292

  4. huggingface/datasets
    ๐Ÿค— The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
    ๐Ÿ”— huggingface.co/docs/datasets
    โญ 18,046

  5. microsoft/unilm
    Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
    ๐Ÿ”— aka.ms/generalai
    โญ 17,130

  6. rare-technologies/gensim
    Topic Modelling for Humans
    ๐Ÿ”— radimrehurek.com/gensim
    โญ 15,008

  7. myshell-ai/OpenVoice
    Instant voice cloning by MyShell.
    ๐Ÿ”— research.myshell.ai/open-voice
    โญ 14,311

  8. gunthercox/ChatterBot
    ChatterBot is a machine learning, conversational dialog engine for creating chat bots
    ๐Ÿ”— chatterbot.readthedocs.io
    โญ 13,827

  9. flairnlp/flair
    A very simple framework for state-of-the-art Natural Language Processing (NLP)
    ๐Ÿ”— flairnlp.github.io/flair
    โญ 13,416

  10. ukplab/sentence-transformers
    Multilingual Sentence & Image Embeddings with BERT
    ๐Ÿ”— www.sbert.net
    โญ 13,146

  11. nltk/nltk
    NLTK Source
    ๐Ÿ”— www.nltk.org
    โญ 12,809

  12. jina-ai/clip-as-service
    ๐Ÿ„ Scalable embedding, reasoning, ranking for images and sentences with CLIP
    ๐Ÿ”— clip-as-service.jina.ai
    โญ 12,095

  13. allenai/allennlp
    An open-source NLP research library, built on PyTorch.
    ๐Ÿ”— www.allennlp.org
    โญ 11,653

  14. facebookresearch/ParlAI
    A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
    ๐Ÿ”— parl.ai
    โญ 10,407

  15. facebookresearch/seamless_communication
    Foundational Models for State-of-the-Art Speech and Text Translation
    โญ 9,721

  16. nvidia/NeMo
    NeMo: a framework for generative AI
    ๐Ÿ”— docs.nvidia.com/nemo-framework/user-guide/latest/index.html
    โญ 9,316

  17. google/sentencepiece
    Unsupervised text tokenizer for Neural Network-based text generation.
    โญ 8,986

  18. togethercomputer/OpenChatKit
    OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots
    โญ 8,973

  19. sloria/TextBlob
    Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
    ๐Ÿ”— textblob.readthedocs.io
    โญ 8,847

  20. doccano/doccano
    Open source annotation tool for machine learning practitioners.
    ๐Ÿ”— doccano.herokuapp.com
    โญ 8,753

  21. clips/pattern
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
    ๐Ÿ”— github.com/clips/pattern/wiki
    โญ 8,629

  22. openai/tiktoken
    tiktoken is a fast BPE tokeniser for use with OpenAI's models.
    โญ 8,559

  23. m-bain/whisperX
    WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
    โญ 7,923

  24. espnet/espnet
    End-to-End Speech Processing Toolkit
    ๐Ÿ”— espnet.github.io/espnet
    โญ 7,629

  25. facebookresearch/nougat
    Implementation of Nougat Neural Optical Understanding for Academic Documents
    ๐Ÿ”— facebookresearch.github.io/nougat
    โญ 7,621

  26. speechbrain/speechbrain
    A PyTorch-based Speech Toolkit
    ๐Ÿ”— speechbrain.github.io
    โญ 7,280

  27. vikparuchuri/marker
    Marker converts PDF, EPUB, and MOBI to markdown. It's 10x faster than nougat, more accurate on most documents, and has low hallucination risk.
    โญ 6,565

  28. deeppavlov/DeepPavlov
    An open source library for deep learning end-to-end dialog systems and chatbots.
    ๐Ÿ”— deeppavlov.ai
    โญ 6,480

  29. neuml/txtai
    ๐Ÿ’ก All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
    ๐Ÿ”— neuml.github.io/txtai
    โญ 6,465

  30. facebookresearch/metaseq
    A codebase for working with Open Pre-trained Transformers, originally forked from fairseq.
    โญ 6,323

  31. kingoflolz/mesh-transformer-jax
    Model parallel transformers in JAX and Haiku
    โญ 6,182

  32. maartengr/BERTopic
    Leveraging BERT and c-TF-IDF to create easily interpretable topics.
    ๐Ÿ”— maartengr.github.io/bertopic
    โญ 5,288

  33. minimaxir/textgenrnn
    Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
    โญ 4,939

  34. vikparuchuri/surya
    OCR and line detection in 90+ languages
    โญ 4,786

  35. salesforce/CodeGen
    CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
    โญ 4,666

  36. facebookresearch/DrQA
    Reading Wikipedia to Answer Open-Domain Questions
    โญ 4,441

  37. prefecthq/marvin
    โœจ Build AI interfaces that spark joy
    ๐Ÿ”— askmarvin.ai
    โญ 4,404

  38. aiwaves-cn/agents
    An Open-source Framework for Autonomous Language Agents
    ๐Ÿ”— www.aiwaves-agents.com
    โญ 4,326

  39. layout-parser/layout-parser
    A Unified Toolkit for Deep Learning Based Document Image Analysis
    ๐Ÿ”— layout-parser.github.io
    โญ 4,282

  40. makcedward/nlpaug
    Data augmentation for NLP
    ๐Ÿ”— makcedward.github.io
    โญ 4,246

  41. thilinarajapakse/simpletransformers
    Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
    ๐Ÿ”— simpletransformers.ai
    โญ 3,939

  42. life4/textdistance
    ๐Ÿ“ Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
    โญ 3,266

  43. jsvine/markovify
    A simple, extensible Markov chain generator.
    โญ 3,252

  44. maartengr/KeyBERT
    Minimal keyword extraction with BERT
    ๐Ÿ”— maartengr.github.io/keybert
    โญ 3,103

  45. bytedance/lightseq
    LightSeq: A High Performance Library for Sequence Processing and Generation
    โญ 3,046

  46. errbotio/errbot
    Errbot is a chatbot, a daemon that connects to your favorite chat service and bring your tools and some fun into the conversation.
    ๐Ÿ”— errbot.io
    โญ 3,034

  47. argilla-io/argilla
    โœจArgilla: the open-source feedback platform for LLMs
    ๐Ÿ”— docs.argilla.io
    โญ 2,930

  48. huawei-noah/Pretrained-Language-Model
    Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
    โญ 2,924

  49. promptslab/Promptify
    Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
    ๐Ÿ”— discord.gg/m88xfymbk6
    โญ 2,901

  50. jbesomi/texthero
    Text preprocessing, representation and visualization from zero to hero.
    ๐Ÿ”— texthero.org
    โญ 2,851

  51. ddangelov/Top2Vec
    Top2Vec learns jointly embedded topic, document and word vectors.
    โญ 2,795

  52. huggingface/neuralcoref
    โœจFast Coreference Resolution in spaCy with Neural Networks
    ๐Ÿ”— huggingface.co/coref
    โญ 2,786

  53. neuralmagic/deepsparse
    Sparsity-aware deep learning inference runtime for CPUs
    ๐Ÿ”— neuralmagic.com/deepsparse
    โญ 2,778

  54. salesforce/CodeT5
    Home of CodeT5: Open Code LLMs for Code Understanding and Generation
    ๐Ÿ”— arxiv.org/abs/2305.07922
    โญ 2,500

  55. bigscience-workshop/promptsource
    Toolkit for creating, sharing and using natural language prompts.
    โญ 2,380

  56. jamesturk/jellyfish
    ๐Ÿชผ a python library for doing approximate and phonetic matching of strings.
    ๐Ÿ”— jamesturk.github.io/jellyfish
    โญ 1,960

  57. alibaba/EasyNLP
    EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
    โญ 1,884

  58. huggingface/setfit
    Efficient few-shot learning with Sentence Transformers
    ๐Ÿ”— hf.co/docs/setfit
    โญ 1,854

  59. thudm/P-tuning-v2
    An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
    โญ 1,820

  60. deepset-ai/FARM
    ๐Ÿก Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
    ๐Ÿ”— farm.deepset.ai
    โญ 1,716

  61. franck-dernoncourt/NeuroNER
    Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.
    ๐Ÿ”— neuroner.com
    โญ 1,673

  62. featureform/featureform
    The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
    ๐Ÿ”— www.featureform.com
    โญ 1,657

  63. plasticityai/magnitude
    A fast, efficient universal vector embedding utility package.
    โญ 1,612

  64. marella/ctransformers
    Python bindings for the Transformer models implemented in C/C++ using GGML library.
    โญ 1,606

  65. arxiv-vanity/arxiv-vanity
    Renders papers from arXiv as responsive web pages so you don't have to squint at a PDF.
    ๐Ÿ”— www.arxiv-vanity.com
    โญ 1,585

  66. google-research/language
    Shared repository for open-sourced projects from the Google AI Language team.
    ๐Ÿ”— ai.google/research/teams/language
    โญ 1,546

  67. explosion/spacy-models
    ๐Ÿ’ซ Models for the spaCy Natural Language Processing (NLP) library
    ๐Ÿ”— spacy.io
    โญ 1,482

  68. chrismattmann/tika-python
    Tika-Python is a Python binding to the Apache Tikaโ„ข REST services allowing Tika to be called natively in the Python community.
    โญ 1,382

  69. gunthercox/chatterbot-corpus
    A multilingual dialog corpus
    ๐Ÿ”— chatterbot-corpus.readthedocs.io
    โญ 1,334

  70. jonasgeiping/cramming
    Cramming the training of a (BERT-type) language model into limited compute.
    โญ 1,208

  71. abertsch72/unlimiformer
    Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
    โญ 1,014

  72. norskregnesentral/skweak
    skweak: A software toolkit for weak supervision applied to NLP tasks
    โญ 908

  73. nomic-ai/nomic
    Interact, analyze and structure massive text, image, embedding, audio and video datasets
    ๐Ÿ”— atlas.nomic.ai
    โญ 907

  74. pemistahl/lingua-py
    The most accurate natural language detection library for Python, suitable for short text and mixed-language text
    โญ 796

  75. openai/grade-school-math
    GSM8K, a dataset of 8.5K high quality linguistically diverse grade school math word problems
    โญ 782

  76. explosion/spacy-streamlit
    ๐Ÿ‘‘ spaCy building blocks and visualizers for Streamlit apps
    ๐Ÿ”— share.streamlit.io/ines/spacy-streamlit-demo/master/app.py
    โญ 753

  77. paddlepaddle/RocketQA
    ๐Ÿš€ RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.
    โญ 727

  78. intellabs/fastRAG
    Efficient Retrieval Augmentation and Generation Framework
    โญ 719

  79. explosion/spacy-stanza
    ๐Ÿ’ฅ Use the latest Stanza (StanfordNLP) research models directly in spaCy
    โญ 708

  80. keras-team/keras-nlp
    Modular Natural Language Processing workflows with Keras
    โญ 635

Packaging

Python packaging, dependency management and bundling.

  1. pyenv/pyenv
    pyenv lets you easily switch between multiple versions of Python.
    โญ 35,504

  2. python-poetry/poetry
    Python packaging and dependency management made easy
    ๐Ÿ”— python-poetry.org
    โญ 28,611

  3. pypa/pipenv
    A virtualenv management tool that supports a multitude of systems and nicely bridges the gaps between pip, python and virtualenv.
    ๐Ÿ”— pipenv.pypa.io
    โญ 24,457

  4. pyinstaller/pyinstaller
    Freeze (package) Python programs into stand-alone executables
    ๐Ÿ”— www.pyinstaller.org
    โญ 11,071

  5. mitsuhiko/rye
    a Hassle-Free Python Experience
    ๐Ÿ”— rye-up.com
    โญ 9,990

  6. pypa/pipx
    Install and Run Python Applications in Isolated Environments
    ๐Ÿ”— pipx.pypa.io
    โญ 8,182

  7. jazzband/pip-tools
    A set of tools to keep your pinned Python dependencies fresh (pip-compile + pip-sync)
    ๐Ÿ”— pip-tools.rtfd.io
    โญ 7,358

  8. pdm-project/pdm
    A modern Python package and dependency manager supporting the latest PEP standards
    ๐Ÿ”— pdm-project.org
    โญ 6,175

  9. conda/conda
    A system-level, binary package and environment manager running on all major operating systems and platforms.
    ๐Ÿ”— docs.conda.io/projects/conda
    โญ 5,987

  10. mamba-org/mamba
    The Fast Cross-Platform Package Manager: mamba is a reimplementation of the conda package manager in C++
    ๐Ÿ”— mamba.readthedocs.io
    โญ 5,868

  11. astral-sh/uv
    An extremely fast Python package installer and resolver, written in Rust. Designed as a drop-in replacement for pip and pip-compile.
    ๐Ÿ”— astral.sh
    โญ 5,671

  12. pypa/hatch
    Modern, extensible Python project management
    ๐Ÿ”— hatch.pypa.io/latest
    โญ 5,088

  13. indygreg/PyOxidizer
    A modern Python application packaging and distribution tool
    โญ 5,063

  14. conda-forge/miniforge
    A conda-forge distribution.
    ๐Ÿ”— conda-forge.org/miniforge
    โญ 4,848

  15. pypa/virtualenv
    A tool to create isolated Python environments. Since Python 3.3, a subset of it has been integrated into the standard lib venv module.
    ๐Ÿ”— virtualenv.pypa.io
    โญ 4,642

  16. spack/spack
    A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
    ๐Ÿ”— spack.io
    โญ 3,852

  17. pantsbuild/pex
    A tool for generating .pex (Python EXecutable) files, lock files and venvs.
    ๐Ÿ”— docs.pex-tool.org
    โญ 2,428

  18. beeware/briefcase
    Tools to support converting a Python project into a standalone native application.
    ๐Ÿ”— briefcase.readthedocs.io
    โญ 2,247

  19. pypa/flit
    Simplified packaging of Python modules
    ๐Ÿ”— flit.pypa.io
    โญ 2,070

  20. linkedin/shiv
    shiv is a command line utility for building fully self contained Python zipapps as outlined in PEP 441, but with all their dependencies included.
    โญ 1,667

  21. marcelotduarte/cx_Freeze
    Create standalone executables from Python scripts, with the same performance and is cross-platform.
    ๐Ÿ”— marcelotduarte.github.io/cx_freeze
    โญ 1,222

  22. ofek/pyapp
    Runtime installer for Python applications
    ๐Ÿ”— ofek.dev/pyapp
    โญ 975

  23. pypa/gh-action-pypi-publish
    The blessed :octocat: GitHub Action, for publishing your ๐Ÿ“ฆ distribution files to PyPI: https://github.com/marketplace/actions/pypi-publish
    ๐Ÿ”— packaging.python.org/guides/publishing-package-distribution-releases-using-github-actions-ci-cd-workflows
    โญ 795

  24. py2exe/py2exe
    Create standalone Windows programs from Python code
    ๐Ÿ”— www.py2exe.org
    โญ 722

  25. snok/install-poetry
    Github action for installing and configuring Poetry
    โญ 506

Pandas

Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations.

  1. pandas-dev/pandas
    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
    ๐Ÿ”— pandas.pydata.org
    โญ 41,260

  2. pola-rs/polars
    Dataframes powered by a multithreaded, vectorized query engine, written in Rust
    ๐Ÿ”— docs.pola.rs
    โญ 24,372

  3. duckdb/duckdb
    DuckDB is an in-process SQL OLAP Database Management System
    ๐Ÿ”— www.duckdb.org
    โญ 14,676

  4. ydataai/ydata-profiling
    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
    ๐Ÿ”— docs.profiling.ydata.ai
    โญ 11,832

  5. gventuri/pandas-ai
    Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
    ๐Ÿ”— pandas-ai.com
    โญ 9,781

  6. kanaries/pygwalker
    PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
    ๐Ÿ”— kanaries.net/home/pygwalker
    โญ 8,895

  7. rapidsai/cudf
    cuDF is a GPU DataFrame library for loading joining, aggregating, filtering, and otherwise manipulating data
    ๐Ÿ”— docs.rapids.ai/api/cudf/stable
    โญ 7,041

  8. aws/aws-sdk-pandas
    pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
    ๐Ÿ”— aws-sdk-pandas.readthedocs.io
    โญ 3,745

  9. nalepae/pandarallel
    A simple and efficient tool to parallelize Pandas operations on all availableย CPUs
    ๐Ÿ”— nalepae.github.io/pandarallel
    โญ 3,429

  10. blaze/blaze
    NumPy and Pandas interface to Big Data
    ๐Ÿ”— blaze.pydata.org
    โญ 3,179

  11. adamerose/PandasGUI
    A GUI for Pandas DataFrames
    โญ 3,096

  12. unionai-oss/pandera
    A light-weight, flexible, and expressive statistical data testing library
    ๐Ÿ”— www.union.ai/pandera
    โญ 2,845

  13. pydata/pandas-datareader
    Extract data from a wide range of Internet sources into a pandas DataFrame.
    ๐Ÿ”— pydata.github.io/pandas-datareader/stable/index.html
    โญ 2,789

  14. scikit-learn-contrib/sklearn-pandas
    Pandas integration with sklearn
    โญ 2,774

  15. jmcarpenter2/swifter
    A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
    โญ 2,422

  16. fugue-project/fugue
    A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
    ๐Ÿ”— fugue-tutorials.readthedocs.io
    โญ 1,830

  17. delta-io/delta-rs
    A native Rust library for Delta Lake, with bindings into Python
    ๐Ÿ”— delta-io.github.io/delta-rs
    โญ 1,693

  18. pyjanitor-devs/pyjanitor
    Clean APIs for data cleaning. Python implementation of R package Janitor
    ๐Ÿ”— pyjanitor-devs.github.io/pyjanitor
    โญ 1,254

  19. machow/siuba
    Python library for using dplyr like syntax with pandas and SQL
    ๐Ÿ”— siuba.org
    โญ 1,086

  20. eventual-inc/Daft
    Distributed DataFrame for Python designed for the cloud, powered by Rust
    ๐Ÿ”— getdaft.io
    โญ 1,039

  21. tkrabel/bamboolib
    bamboolib - a GUI for pandas DataFrames
    ๐Ÿ”— bamboolib.com
    โญ 925

  22. holoviz/hvplot
    A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews
    ๐Ÿ”— hvplot.holoviz.org
    โญ 902

  23. renumics/spotlight
    Interactively explore unstructured datasets from your dataframe.
    ๐Ÿ”— renumics.com
    โญ 857

Performance

Performance, parallelisation and low level libraries.

  1. celery/celery
    Distributed Task Queue (development branch)
    ๐Ÿ”— docs.celeryq.dev
    โญ 23,061

  2. google/flatbuffers
    FlatBuffers: Memory Efficient Serialization Library
    ๐Ÿ”— flatbuffers.dev
    โญ 21,763

  3. pybind/pybind11
    Seamless operability between C++11 and Python
    ๐Ÿ”— pybind11.readthedocs.io
    โญ 14,459

  4. exaloop/codon
    A high-performance, zero-overhead, extensible Python compiler using LLVM
    ๐Ÿ”— docs.exaloop.io/codon
    โญ 13,684

  5. dask/dask
    Parallel computing with task scheduling
    ๐Ÿ”— dask.org
    โญ 11,794

  6. modin-project/modin
    Modin: Scale your Pandas workflows by changing a single line of code
    ๐Ÿ”— modin.readthedocs.io
    โญ 9,330

  7. numba/numba
    NumPy aware dynamic Python compiler using LLVM
    ๐Ÿ”— numba.pydata.org
    โญ 9,250

  8. nebuly-ai/nebuly
    The user analytics platform for LLMs
    ๐Ÿ”— www.nebuly.com
    โญ 8,345

  9. vaexio/vaex
    Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐Ÿš€
    ๐Ÿ”— vaex.io
    โญ 8,136

  10. mher/flower
    Real-time monitor and web admin for Celery distributed task queue
    ๐Ÿ”— flower.readthedocs.io
    โญ 6,070

  11. python-trio/trio
    Trio โ€“ a friendly Python library for async concurrency and I/O
    ๐Ÿ”— trio.readthedocs.io
    โญ 5,771

  12. ultrajson/ultrajson
    Ultra fast JSON decoder and encoder written in C with Python bindings
    ๐Ÿ”— pypi.org/project/ujson
    โญ 4,202

  13. facebookincubator/cinder
    Cinder is Meta's internal performance-oriented production version of CPython.
    ๐Ÿ”— trycinder.com
    โญ 3,341

  14. tlkh/asitop
    Perf monitoring CLI tool for Apple Silicon
    ๐Ÿ”— tlkh.github.io/asitop
    โญ 2,549

  15. ipython/ipyparallel
    IPython Parallel: Interactive Parallel Computing in Python
    ๐Ÿ”— ipyparallel.readthedocs.io
    โญ 2,532

  16. h5py/h5py
    HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.
    ๐Ÿ”— www.h5py.org
    โญ 1,978

  17. intel/intel-extension-for-transformers
    โšก Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsโšก
    โญ 1,797

  18. faster-cpython/ideas
    Discussion and work tracker for Faster CPython project.
    โญ 1,630

  19. airtai/faststream
    FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
    ๐Ÿ”— faststream.airt.ai/latest
    โญ 1,565

  20. agronholm/anyio
    High level asynchronous concurrency and networking framework that works on top of either trio or asyncio
    โญ 1,526

  21. dask/distributed
    A distributed task scheduler for Dask
    ๐Ÿ”— distributed.dask.org
    โญ 1,522

  22. tiangolo/asyncer
    Asyncer, async and await, focused on developer experience.
    ๐Ÿ”— asyncer.tiangolo.com
    โญ 1,352

  23. nschloe/perfplot
    ๐Ÿ“ˆ Performance analysis for Python snippets
    โญ 1,270

  24. intel/intel-extension-for-pytorch
    A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
    โญ 1,207

  25. intel/scikit-learn-intelex
    Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
    ๐Ÿ”— intel.github.io/scikit-learn-intelex
    โญ 1,128

  26. markshannon/faster-cpython
    How to make CPython faster.
    โญ 934

  27. zerointensity/pointers.py
    Bringing the hell of pointers to Python.
    ๐Ÿ”— pointers.zintensity.dev
    โญ 855

  28. brandtbucher/specialist
    Visualize CPython 3.11's specializing, adaptive interpreter. ๐Ÿ”ฅ
    โญ 608

Profiling

Memory and CPU/GPU profiling tools and libraries.

  1. bloomberg/memray
    Memray is a memory profiler for Python
    ๐Ÿ”— bloomberg.github.io/memray
    โญ 12,239

  2. benfred/py-spy
    Sampling profiler for Python programs
    โญ 11,555

  3. plasma-umass/scalene
    Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
    โญ 10,924

  4. joerick/pyinstrument
    ๐Ÿšดย Call stack profiler for Python. Shows you why your code is slow!
    ๐Ÿ”— pyinstrument.readthedocs.io
    โญ 5,966

  5. pythonprofilers/memory_profiler
    Monitor Memory usage of Python code
    ๐Ÿ”— pypi.python.org/pypi/memory_profiler
    โญ 4,149

  6. gaogaotiantian/viztracer
    VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.
    ๐Ÿ”— viztracer.readthedocs.io
    โญ 4,117

  7. reloadware/reloadium
    Hot Reloading, Profiling and AI debugging for Python
    ๐Ÿ”— reloadium.io
    โญ 2,647

  8. pyutils/line_profiler
    Line-by-line profiling for Python
    โญ 2,371

  9. jiffyclub/snakeviz
    An in-browser Python profile viewer
    ๐Ÿ”— jiffyclub.github.io/snakeviz
    โญ 2,182

  10. p403n1x87/austin
    Python frame stack sampler for CPython
    ๐Ÿ”— pypi.org/project/austin-dist
    โญ 1,331

  11. pythonspeed/filprofiler
    A Python memory profiler for data processing and scientific computing applications
    ๐Ÿ”— pythonspeed.com/products/filmemoryprofiler
    โญ 806

Security

Security related libraries: vulnerability discovery, SQL injection, environment auditing.

  1. swisskyrepo/PayloadsAllTheThings
    A list of useful payloads and bypass for Web Application Security and Pentest/CTF
    ๐Ÿ”— swisskyrepo.github.io/payloadsallthethings
    โญ 55,404

  2. certbot/certbot
    Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.
    โญ 30,623

  3. sqlmapproject/sqlmap
    Automatic SQL injection and database takeover tool
    ๐Ÿ”— sqlmap.org
    โญ 29,958

  4. aquasecurity/trivy
    Find vulnerabilities, misconfigurations, secrets, SBOM in containers, Kubernetes, code repositories, clouds and more
    ๐Ÿ”— aquasecurity.github.io/trivy
    โญ 20,488

  5. bridgecrewio/checkov
    Checkov is a static code analysis tool for infrastructure as code (IaC) and also a software composition analysis (SCA) tool for images and open source packages.
    ๐Ÿ”— www.checkov.io
    โญ 6,370

  6. nccgroup/ScoutSuite
    Multi-Cloud Security Auditing Tool
    โญ 6,014

  7. pycqa/bandit
    Bandit is a tool designed to find common security issues in Python code.
    ๐Ÿ”— bandit.readthedocs.io
    โญ 5,810

  8. stamparm/maltrail
    Malicious traffic detection system
    โญ 5,618

  9. rhinosecuritylabs/pacu
    The AWS exploitation framework, designed for testing the security of Amazon Web Services environments.
    ๐Ÿ”— rhinosecuritylabs.com/aws/pacu-open-source-aws-exploitation-framework
    โญ 3,926

  10. pyupio/safety
    Safety checks Python dependencies for known security vulnerabilities and suggests the proper remediations for vulnerabilities detected.
    ๐Ÿ”— safetycli.com/product/safety-cli
    โญ 1,596

  11. trailofbits/pip-audit
    Audits Python environments and dependency trees for known vulnerabilities
    ๐Ÿ”— pypi.org/project/pip-audit
    โญ 888

  12. fadi002/de4py
    toolkit for python reverse engineering
    ๐Ÿ”— de4py.000.pe
    โญ 605

Simulation

Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover.

  1. atsushisakai/PythonRobotics
    Python sample codes for robotics algorithms.
    ๐Ÿ”— atsushisakai.github.io/pythonrobotics
    โญ 21,074

  2. bulletphysics/bullet3
    Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
    ๐Ÿ”— bulletphysics.org
    โญ 11,640

  3. isl-org/Open3D
    Open3D: A Modern Library for 3D Data Processing
    ๐Ÿ”— www.open3d.org
    โญ 10,155

  4. qiskit/qiskit
    Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
    ๐Ÿ”— www.ibm.com/quantum/qiskit
    โญ 4,342

  5. astropy/astropy
    Astronomy and astrophysics core library
    ๐Ÿ”— www.astropy.org
    โญ 4,122

  6. quantumlib/Cirq
    A python framework for creating, editing, and invoking Noisy Intermediate Scale Quantum (NISQ) circuits.
    โญ 4,068

  7. openai/mujoco-py
    MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
    โญ 2,692

  8. rdkit/rdkit
    The official sources for the RDKit library
    โญ 2,346

  9. projectmesa/mesa
    Mesa is an open-source Python library for agent-based modeling, ideal for simulating complex systems and exploring emergent behaviors.
    โญ 2,125

  10. google/brax
    Massively parallel rigidbody physics simulation on accelerator hardware.
    โญ 1,983

  11. quantecon/QuantEcon.py
    A community based Python library for quantitative economics
    ๐Ÿ”— quantecon.org/quantecon-py
    โญ 1,827

  12. microsoft/PromptCraft-Robotics
    Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
    ๐Ÿ”— aka.ms/chatgpt-robotics
    โญ 1,628

  13. facebookresearch/habitat-lab
    A modular high-level library to train embodied AI agents across a variety of tasks and environments.
    ๐Ÿ”— aihabitat.org
    โญ 1,592

  14. nvidia/warp
    A Python framework for high performance GPU simulation and graphics
    ๐Ÿ”— nvidia.github.io/warp
    โญ 1,535

  15. nvidia-omniverse/IsaacGymEnvs
    Isaac Gym Reinforcement Learning Environments
    โญ 1,434

  16. deepmodeling/deepmd-kit
    A deep learning package for many-body potential energy representation and molecular dynamics
    ๐Ÿ”— docs.deepmodeling.com/projects/deepmd
    โญ 1,313

  17. sail-sg/envpool
    C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
    ๐Ÿ”— envpool.readthedocs.io
    โญ 978

  18. a-r-j/graphein
    Protein Graph Library
    ๐Ÿ”— graphein.ai
    โญ 954

  19. hardmaru/estool
    Evolution Strategies Tool
    โญ 917

  20. facebookresearch/fairo
    A modular embodied agent architecture and platform for building embodied agents
    โญ 827

  21. google/evojax
    EvoJAX is a scalable, general purpose, hardware-accelerated neuroevolution toolkit built on the JAX library
    โญ 748

  22. google-deepmind/materials_discovery
    Graph Networks for Materials Science (GNoME) is a project centered around scaling machine learning methods to tackle materials science.
    โญ 728

  23. bowang-lab/scGPT
    scGPT: Towards Building a Foundation Model for Single-Cell Multi-omics Using Generative AI
    ๐Ÿ”— scgpt.readthedocs.io/en/latest
    โญ 618

  24. nvidia-omniverse/OmniIsaacGymEnvs
    Reinforcement Learning Environments for Omniverse Isaac Gym
    โญ 568

  25. nvidia-omniverse/orbit
    Unified framework for robot learning built on NVIDIA Isaac Sim
    ๐Ÿ”— isaac-orbit.github.io/orbit
    โญ 562

Study

Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials.

  1. thealgorithms/Python
    All Algorithms implemented in Python
    ๐Ÿ”— the-algorithms.com
    โญ 175,886

  2. jakevdp/PythonDataScienceHandbook
    Python Data Science Handbook: full text in Jupyter Notebooks
    ๐Ÿ”— jakevdp.github.io/pythondatasciencehandbook
    โญ 40,899

  3. realpython/python-guide
    Python best practices guidebook, written for humans.
    ๐Ÿ”— docs.python-guide.org
    โญ 27,282

  4. mlabonne/llm-course
    Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
    ๐Ÿ”— mlabonne.github.io/blog
    โญ 23,958

  5. microsoft/generative-ai-for-beginners
    18 Lessons, Get Started Building with Generative AI ๐Ÿ”— https://microsoft.github.io/generative-ai-for-beginners/
    ๐Ÿ”— microsoft.github.io/generative-ai-for-beginners
    โญ 23,917

  6. christoschristofidis/awesome-deep-learning
    A curated list of awesome Deep Learning tutorials, projects and communities.
    โญ 22,407

  7. wesm/pydata-book
    Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
    โญ 20,945

  8. d2l-ai/d2l-en
    Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
    ๐Ÿ”— d2l.ai
    โญ 20,901

  9. fchollet/deep-learning-with-python-notebooks
    Jupyter notebooks for the code samples of the book "Deep Learning with Python"
    โญ 17,604

  10. microsoft/recommenders
    Best Practices on Recommendation Systems
    ๐Ÿ”— recommenders-team.github.io/recommenders/intro.html
    โญ 17,563

  11. graykode/nlp-tutorial
    Natural Language Processing Tutorial for Deep Learning Researchers
    ๐Ÿ”— www.reddit.com/r/machinelearning/comments/amfinl/project_nlptutoral_repository_who_is_studying
    โญ 13,456

  12. shangtongzhang/reinforcement-learning-an-introduction
    Python Implementation of Reinforcement Learning: An Introduction
    โญ 13,038

  13. hannibal046/Awesome-LLM
    Awesome-LLM: a curated list of Large Language Model
    โญ 12,378

  14. eugeneyan/open-llms
    ๐Ÿ“‹ A list of open LLMs available for commercial use.
    โญ 9,526

  15. openai/spinningup
    An educational resource to help anyone learn deep reinforcement learning.
    ๐Ÿ”— spinningup.openai.com
    โญ 9,414

  16. karpathy/nn-zero-to-hero
    Neural Networks: Zero to Hero
    โญ 9,373

  17. mooler0410/LLMsPracticalGuide
    A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
    ๐Ÿ”— arxiv.org/abs/2304.13712v2
    โญ 8,037

  18. karpathy/micrograd
    A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
    โญ 7,429

  19. firmai/industry-machine-learning
    A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
    ๐Ÿ”— www.linkedin.com/company/firmai
    โญ 6,974

  20. nielsrogge/Transformers-Tutorials
    This repository contains demos I made with the Transformers library by HuggingFace.
    โญ 6,924

  21. mrdbourke/pytorch-deep-learning
    Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
    ๐Ÿ”— learnpytorch.io
    โญ 6,799

  22. zhanymkanov/fastapi-best-practices
    FastAPI Best Practices and Conventions we used at our startup
    โญ 6,300

  23. gkamradt/langchain-tutorials
    Overview and tutorial of the LangChain Library
    โญ 5,898

  24. udacity/deep-learning-v2-pytorch
    Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101
    โญ 5,132

  25. srush/GPU-Puzzles
    Teaching beginner GPU programming in a completely interactive fashion
    โญ 4,778

  26. mrdbourke/tensorflow-deep-learning
    All course materials for the Zero to Mastery Deep Learning with TensorFlow course.
    ๐Ÿ”— dbourke.link/ztmtfcourse
    โญ 4,728

  27. neetcode-gh/leetcode
    Leetcode solutions for NeetCode.io
    โญ 4,621

  28. udlbook/udlbook
    Understanding Deep Learning - Simon J.D. Prince
    โญ 4,367

  29. timofurrer/awesome-asyncio
    A curated list of awesome Python asyncio frameworks, libraries, software and resources
    โญ 4,332

  30. zotroneneis/machine_learning_basics
    Plain python implementations of basic machine learning algorithms
    โญ 4,170

  31. roboflow/notebooks
    Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
    ๐Ÿ”— roboflow.com/models
    โญ 3,790

  32. huggingface/deep-rl-class
    This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
    โญ 3,500

  33. cosmicpython/book
    A Book about Pythonic Application Architecture Patterns for Managing Complexity. Cosmos is the Opposite of Chaos you see. O'R. wouldn't actually let us call it "Cosmic Python" tho.
    ๐Ÿ”— www.cosmicpython.com
    โญ 3,188

  34. huggingface/diffusion-models-class
    Materials for the Hugging Face Diffusion Models Course
    โญ 3,028

  35. alirezadir/Machine-Learning-Interviews
    This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
    โญ 2,939

  36. promptslab/Awesome-Prompt-Engineering
    This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
    ๐Ÿ”— discord.gg/m88xfymbk6
    โญ 2,926

  37. fluentpython/example-code-2e
    Example code for Fluent Python, 2nd edition (O'Reilly 2022)
    ๐Ÿ”— amzn.to/3j48u2j
    โญ 2,776

  38. rasbt/machine-learning-book
    Code Repository for Machine Learning with PyTorch and Scikit-Learn
    ๐Ÿ”— sebastianraschka.com/books/#machine-learning-with-pytorch-and-scikit-learn
    โญ 2,653

  39. amanchadha/coursera-deep-learning-specialization
    Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv...
    โญ 2,538

  40. mrdbourke/zero-to-mastery-ml
    All course materials for the Zero to Mastery Machine Learning and Data Science course.
    ๐Ÿ”— dbourke.link/ztmmlcourse
    โญ 2,459

  41. krzjoa/awesome-python-data-science
    Probably the best curated list of data science software in Python.
    ๐Ÿ”— krzjoa.github.io/awesome-python-data-science
    โญ 2,225

  42. cgpotts/cs224u
    Code for CS224u: Natural Language Understanding
    โญ 2,044

  43. cerlymarco/MEDIUM_NoteBook
    Repository containing notebooks of my posts on Medium
    โญ 1,996

  44. trananhkma/fucking-awesome-python
    awesome-python with :octocat: โญ and ๐Ÿด
    โญ 1,951

  45. gerdm/prml
    Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop
    โญ 1,708

  46. atcold/NYU-DLSP21
    NYU Deep Learning Spring 2021
    ๐Ÿ”— atcold.github.io/nyu-dlsp21
    โญ 1,483

  47. chandlerbang/awesome-self-supervised-gnn
    Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).
    โญ 1,396

  48. patrickloeber/MLfromscratch
    Machine Learning algorithm implementations from scratch.
    โญ 1,115

  49. jackhidary/quantumcomputingbook
    Companion site for the textbook Quantum Computing: An Applied Approach
    โญ 744

  50. davidadsp/Generative_Deep_Learning_2nd_Edition
    The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
    ๐Ÿ”— www.oreilly.com/library/view/generative-deep-learning/9781098134174
    โญ 734

  51. rasbt/stat453-deep-learning-ss20
    STAT 453: Intro to Deep Learning @ UW-Madison (Spring 2020)
    ๐Ÿ”— pages.stat.wisc.edu/~sraschka/teaching/stat453-ss2020
    โญ 543

Template

Template tools and libraries: cookiecutter repos, generators, quick-starts.

  1. cookiecutter/cookiecutter
    A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
    ๐Ÿ”— pypi.org/project/cookiecutter
    โญ 21,217

  2. tiangolo/full-stack-fastapi-postgresql
    Full stack, modern web application generator. Using FastAPI, PostgreSQL as database, Docker, automatic HTTPS and more.
    โญ 14,611

  3. drivendata/cookiecutter-data-science
    A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
    ๐Ÿ”— drivendata.github.io/cookiecutter-data-science
    โญ 7,417

  4. buuntu/fastapi-react
    ๐Ÿš€ Cookiecutter Template for FastAPI + React Projects. Using PostgreSQL, SQLAlchemy, and Docker
    โญ 1,982

  5. pyscaffold/pyscaffold
    ๐Ÿ›  Python project template generator with batteries included
    ๐Ÿ”— pyscaffold.org
    โญ 1,968

  6. cjolowicz/cookiecutter-hypermodern-python
    Cookiecutter template for a Python package based on the Hypermodern Python article series.
    ๐Ÿ”— cookiecutter-hypermodern-python.readthedocs.io
    โญ 1,679

  7. tezromach/python-package-template
    ๐Ÿš€ Your next Python package needs a bleeding-edge project structure.
    โญ 1,066

  8. martinheinz/python-project-blueprint
    Blueprint/Boilerplate For Python Projects
    โญ 933

Terminal

Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars.

  1. willmcgugan/rich
    Rich is a Python library for rich text and beautiful formatting in the terminal.
    ๐Ÿ”— rich.readthedocs.io/en/latest
    โญ 46,427

  2. tqdm/tqdm
    โšก A Fast, Extensible Progress Bar for Python and CLI
    ๐Ÿ”— tqdm.github.io
    โญ 27,029

  3. google/python-fire
    Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
    โญ 25,981

  4. willmcgugan/textual
    The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
    ๐Ÿ”— textual.textualize.io
    โญ 22,940

  5. pallets/click
    Python composable command line interface toolkit
    ๐Ÿ”— click.palletsprojects.com
    โญ 14,814

  6. tiangolo/typer
    Typer, build great CLIs. Easy to code. Based on Python type hints.
    ๐Ÿ”— typer.tiangolo.com
    โญ 13,065

  7. saulpw/visidata
    A terminal spreadsheet multitool for discovering and arranging data
    ๐Ÿ”— visidata.org
    โญ 7,271

  8. manrajgrover/halo
    ๐Ÿ’ซ Beautiful spinners for terminal, IPython and Jupyter
    โญ 2,824

  9. urwid/urwid
    Console user interface library for Python (official repo)
    ๐Ÿ”— urwid.org
    โญ 2,693

  10. textualize/trogon
    Easily turn your Click CLI into a powerful terminal application
    โญ 2,275

  11. tconbeer/harlequin
    The SQL IDE for Your Terminal.
    ๐Ÿ”— harlequin.sh
    โญ 2,126

  12. tmbo/questionary
    Python library to build pretty command line user prompts โœจEasy to use multi-select lists, confirmations, free text prompts ...
    โญ 1,358

  13. jazzband/prettytable
    Display tabular data in a visually appealing ASCII table format
    ๐Ÿ”— pypi.org/project/prettytable
    โญ 1,213

  14. 1j01/textual-paint
    ๐ŸŽจ MS Paint in your terminal.
    ๐Ÿ”— pypi.org/project/textual-paint
    โญ 876

Testing

Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins.

  1. locustio/locust
    Write scalable load tests in plain Python ๐Ÿš—๐Ÿ’จ
    โญ 23,158

  2. pytest-dev/pytest
    The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
    ๐Ÿ”— pytest.org
    โญ 11,134

  3. microsoft/playwright-python
    Python version of the Playwright testing and automation library.
    ๐Ÿ”— playwright.dev/python
    โญ 10,151

  4. robotframework/robotframework
    Generic automation framework for acceptance testing and RPA
    ๐Ÿ”— robotframework.org
    โญ 8,806

  5. hypothesisworks/hypothesis
    Hypothesis is a powerful, flexible, and easy to use library for property-based testing.
    ๐Ÿ”— hypothesis.works
    โญ 7,149

  6. newsapps/beeswithmachineguns
    A utility for arming (creating) many bees (micro EC2 instances) to attack (load test) targets (web applications).
    ๐Ÿ”— apps.chicagotribune.com
    โญ 6,378

  7. getsentry/responses
    A utility for mocking out the Python Requests library.
    โญ 4,010

  8. seleniumbase/SeleniumBase
    Browser automation framework for testing with Selenium, Python, and pytest. Includes a Dashboard, a Recorder for generating tests, Undetected Mode, and more.
    ๐Ÿ”— seleniumbase.io
    โญ 3,950

  9. spulec/freezegun
    Let your Python tests travel through time
    โญ 3,918

  10. tox-dev/tox
    Command line driven CI frontend and development task automation tool.
    ๐Ÿ”— tox.wiki
    โญ 3,452

  11. behave/behave
    BDD, Python style.
    ๐Ÿ”— behave.readthedocs.io/en/latest
    โญ 3,024

  12. nedbat/coveragepy
    The code coverage tool for Python
    ๐Ÿ”— coverage.readthedocs.io
    โญ 2,762

  13. cobrateam/splinter
    splinter - python test framework for web applications
    ๐Ÿ”— splinter.readthedocs.org/en/stable/index.html
    โญ 2,676

  14. kevin1024/vcrpy
    Automatically mock your HTTP interactions to simplify and speed up testing
    โญ 2,569

  15. pytest-dev/pytest-testinfra
    With Testinfra you can write unit tests in Python to test actual state of your servers configured by management tools like Salt, Ansible, Puppet, Chef and so on.
    ๐Ÿ”— testinfra.readthedocs.io
    โญ 2,298

  16. pytest-dev/pytest-mock
    Thin-wrapper around the mock package for easier use with pytest
    ๐Ÿ”— pytest-mock.readthedocs.io/en/latest
    โญ 1,724

  17. pytest-dev/pytest-cov
    Coverage plugin for pytest.
    โญ 1,628

  18. pytest-dev/pytest-xdist
    pytest plugin for distributed testing and loop-on-failures testing modes.
    ๐Ÿ”— pytest-xdist.readthedocs.io
    โญ 1,311

  19. pytest-dev/pytest-asyncio
    Asyncio support for pytest
    ๐Ÿ”— pytest-asyncio.readthedocs.io
    โญ 1,292

  20. confident-ai/deepeval
    The Evaluation Framework for LLMs
    ๐Ÿ”— docs.confident-ai.com
    โญ 1,205

  21. taverntesting/tavern
    A command-line tool and Python library and Pytest plugin for automated testing of RESTful APIs, with a simple, concise and flexible YAML-based syntax
    ๐Ÿ”— taverntesting.github.io
    โญ 978

Machine Learning - Time Series

Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics.

  1. facebook/prophet
    Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
    ๐Ÿ”— facebook.github.io/prophet
    โญ 17,484

  2. blue-yonder/tsfresh
    Automatic extraction of relevant features from time series:
    ๐Ÿ”— tsfresh.readthedocs.io
    โญ 8,000

  3. sktime/sktime
    A unified framework for machine learning with time series
    ๐Ÿ”— www.sktime.net
    โญ 7,209

  4. unit8co/darts
    A python library for user-friendly forecasting and anomaly detection on time series.
    ๐Ÿ”— unit8co.github.io/darts
    โญ 7,012

  5. facebookresearch/Kats
    Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends.
    โญ 4,682

  6. awslabs/gluonts
    Probabilistic time series modeling in Python
    ๐Ÿ”— ts.gluon.ai
    โญ 4,104

  7. nixtla/statsforecast
    Lightning โšก๏ธ fast forecasting with statistical and econometric models.
    ๐Ÿ”— nixtlaverse.nixtla.io/statsforecast
    โญ 3,395

  8. salesforce/Merlion
    Merlion: A Machine Learning Framework for Time Series Intelligence
    โญ 3,214

  9. tdameritrade/stumpy
    STUMPY is a powerful and scalable Python library for modern time series analysis
    ๐Ÿ”— stumpy.readthedocs.io/en/latest
    โญ 2,924

  10. rjt1990/pyflux
    Open source time series library for Python
    โญ 2,078

  11. aistream-peelout/flow-forecast
    Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).
    ๐Ÿ”— flow-forecast.atlassian.net/wiki/spaces/ff/overview
    โญ 1,792

  12. uber/orbit
    A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.
    ๐Ÿ”— orbit-ml.readthedocs.io/en/stable
    โญ 1,781

  13. alkaline-ml/pmdarima
    A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
    ๐Ÿ”— www.alkaline-ml.com/pmdarima
    โญ 1,491

  14. winedarksea/AutoTS
    Automated Time Series Forecasting
    โญ 952

  15. autoviml/Auto_TS
    Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Created by Ram Seshadri. Collaborators welcome.
    โญ 663

  16. time-series-foundation-models/lag-llama
    Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting
    โญ 623

  17. google/temporian
    Temporian is an open-source Python library for preprocessing โšก and feature engineering ๐Ÿ›  temporal data ๐Ÿ“ˆ for machine learning applications ๐Ÿค–
    ๐Ÿ”— temporian.readthedocs.io
    โญ 544

Typing

Typing libraries: static and run-time type checking, annotations.

  1. python/mypy
    Optional static typing for Python
    ๐Ÿ”— www.mypy-lang.org
    โญ 17,201

  2. microsoft/pyright
    Static Type Checker for Python
    โญ 11,684

  3. facebook/pyre-check
    Performant type-checking for python.
    ๐Ÿ”— pyre-check.org
    โญ 6,631

  4. python-attrs/attrs
    Python Classes Without Boilerplate
    ๐Ÿ”— www.attrs.org
    โญ 5,020

  5. google/pytype
    A static type analyzer for Python code
    ๐Ÿ”— google.github.io/pytype
    โญ 4,484

  6. instagram/MonkeyType
    A Python library that generates static type annotations by collecting runtime types
    โญ 4,479

  7. python/typeshed
    Collection of library stubs for Python, with static types
    โญ 3,964

  8. microsoft/pylance-release
    Fast, feature-rich language support for Python. Documentation and issues for Pylance.
    โญ 1,640

  9. agronholm/typeguard
    Run-time type checker for Python
    โญ 1,401

  10. patrick-kidger/torchtyping
    Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.
    โญ 1,314

Utility

General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools,.

  1. home-assistant/core
    ๐Ÿก Open source home automation that puts local control and privacy first.
    ๐Ÿ”— www.home-assistant.io
    โญ 67,405

  2. yt-dlp/yt-dlp
    A youtube-dl fork with additional features and fixes
    ๐Ÿ”— discord.gg/h5mncfw63r
    โญ 66,417

  3. python/cpython
    The Python programming language
    ๐Ÿ”— www.python.org
    โญ 58,457

  4. localstack/localstack
    ๐Ÿ’ป A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
    ๐Ÿ”— localstack.cloud
    โญ 51,366

  5. faif/python-patterns
    A collection of design patterns/idioms in Python
    โญ 39,064

  6. mingrammer/diagrams
    ๐ŸŽจ Diagram as Code for prototyping cloud system architectures
    ๐Ÿ”— diagrams.mingrammer.com
    โญ 34,201

  7. ggerganov/whisper.cpp
    Port of OpenAI's Whisper model in C/C++
    โญ 28,090

  8. keon/algorithms
    Minimal examples of data structures and algorithms in Python
    โญ 23,377

  9. norvig/pytudes
    Python programs, usually short, of considerable difficulty, to perfect particular skills.
    โญ 22,210

  10. facebookresearch/audiocraft
    Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
    โญ 18,827

  11. openai/openai-python
    The official Python library for the OpenAI API
    ๐Ÿ”— pypi.org/project/openai
    โญ 18,483

  12. micropython/micropython
    MicroPython - a lean and efficient Python implementation for microcontrollers and constrained systems
    ๐Ÿ”— micropython.org
    โญ 17,990

  13. mkdocs/mkdocs
    Project documentation with Markdown.
    ๐Ÿ”— www.mkdocs.org
    โญ 17,939

  14. pydantic/pydantic
    Data validation using Python type hints
    ๐Ÿ”— docs.pydantic.dev
    โญ 17,680

  15. delgan/loguru
    Python logging made (stupidly) simple
    โญ 17,446

  16. squidfunk/mkdocs-material
    Documentation that simply works
    ๐Ÿ”— squidfunk.github.io/mkdocs-material
    โญ 17,422

  17. rustpython/RustPython
    A Python Interpreter written in Rust
    ๐Ÿ”— rustpython.github.io
    โญ 17,166

  18. modularml/mojo
    The Mojo Programming Language
    ๐Ÿ”— docs.modular.com/mojo
    โญ 17,034

  19. kivy/kivy
    Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
    ๐Ÿ”— kivy.org
    โญ 16,725

  20. ipython/ipython
    Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
    ๐Ÿ”— ipython.readthedocs.org
    โญ 16,089

  21. alievk/avatarify-python
    Avatars for Zoom, Skype and other video-conferencing apps.
    โญ 16,025

  22. blakeblackshear/frigate
    NVR with realtime local object detection for IP cameras
    ๐Ÿ”— frigate.video
    โญ 13,667

  23. zulko/moviepy
    Video editing with Python
    ๐Ÿ”— zulko.github.io/moviepy
    โญ 11,505

  24. python-pillow/Pillow
    Python Imaging Library (Fork)
    ๐Ÿ”— python-pillow.org
    โญ 11,481

  25. dbader/schedule
    Python job scheduling for humans.
    ๐Ÿ”— schedule.readthedocs.io
    โญ 11,371

  26. pyodide/pyodide
    Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
    ๐Ÿ”— pyodide.org/en/stable
    โญ 11,056

  27. nuitka/Nuitka
    Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 3.10, and 3.11. You feed it your Python app, it does a lot of clever things, and spits out an executable or extension module.
    ๐Ÿ”— nuitka.net
    โญ 10,471

  28. pyo3/pyo3
    Rust bindings for the Python interpreter
    ๐Ÿ”— pyo3.rs
    โญ 10,451

  29. ninja-build/ninja
    Ninja is a small build system with a focus on speed.
    ๐Ÿ”— ninja-build.org
    โญ 10,302

  30. pytube/pytube
    A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
    ๐Ÿ”— pytube.io
    โญ 9,997

  31. openai/triton
    Development repository for the Triton language and compiler
    ๐Ÿ”— triton-lang.org
    โญ 9,981

  32. caronc/apprise
    Apprise - Push Notifications that work with just about every platform!
    ๐Ÿ”— hub.docker.com/r/caronc/apprise
    โญ 9,949

  33. magicstack/uvloop
    Ultra fast asyncio event loop.
    โญ 9,846

  34. secdev/scapy
    Scapy: the Python-based interactive packet manipulation program & library. Supports Python 2 & Python 3.
    ๐Ÿ”— scapy.net
    โญ 9,821

  35. pallets/jinja
    A very fast and expressive template engine.
    ๐Ÿ”— jinja.palletsprojects.com
    โญ 9,794

  36. asweigart/pyautogui
    A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.
    โญ 9,264

  37. aws/serverless-application-model
    The AWS Serverless Application Model (AWS SAM) transform is a AWS CloudFormation macro that transforms SAM templates into CloudFormation templates.
    ๐Ÿ”— aws.amazon.com/serverless/sam
    โญ 9,205

  38. paramiko/paramiko
    The leading native Python SSHv2 protocol library.
    ๐Ÿ”— paramiko.org
    โญ 8,725

  39. cython/cython
    The most widely used Python to C compiler
    ๐Ÿ”— cython.org
    โญ 8,718

  40. boto/boto3
    AWS SDK for Python
    ๐Ÿ”— aws.amazon.com/sdk-for-python
    โญ 8,582

  41. arrow-py/arrow
    ๐Ÿน Better dates & times for Python
    ๐Ÿ”— arrow.readthedocs.io
    โญ 8,494

  42. facebookresearch/hydra
    Hydra is a framework for elegantly configuring complex applications
    ๐Ÿ”— hydra.cc
    โญ 7,965

  43. xonsh/xonsh
    ๐Ÿš Python-powered, cross-platform, Unix-gazing shell.
    ๐Ÿ”— xon.sh
    โญ 7,792

  44. eternnoir/pyTelegramBotAPI
    Python Telegram bot api.
    โญ 7,521

  45. kellyjonbrazil/jc
    CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts.
    โญ 7,395

  46. googleapis/google-api-python-client
    ๐Ÿ The official Python client library for Google's discovery based APIs.
    ๐Ÿ”— googleapis.github.io/google-api-python-client/docs
    โญ 7,240

  47. py-pdf/pypdf
    A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
    ๐Ÿ”— pypdf.readthedocs.io/en/latest
    โญ 7,053

  48. theskumar/python-dotenv
    Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.
    ๐Ÿ”— saurabh-kumar.com/python-dotenv
    โญ 6,883

  49. googlecloudplatform/python-docs-samples
    Code samples used on cloud.google.com
    โญ 6,864

  50. google/latexify_py
    A library to generate LaTeX expression from Python code.
    โญ 6,852

  51. marshmallow-code/marshmallow
    A lightweight library for converting complex objects to and from simple Python datatypes.
    ๐Ÿ”— marshmallow.readthedocs.io
    โญ 6,819

  52. hugapi/hug
    Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler.
    โญ 6,804

  53. pygithub/PyGithub
    Typed interactions with the GitHub API v3
    ๐Ÿ”— pygithub.readthedocs.io
    โญ 6,558

  54. openai/point-e
    Point cloud diffusion for 3D model synthesis
    โญ 6,187

  55. pyca/cryptography
    cryptography is a package designed to expose cryptographic primitives and recipes to Python developers.
    ๐Ÿ”— cryptography.io
    โญ 6,102

  56. sdispater/pendulum
    Python datetimes made easy
    ๐Ÿ”— pendulum.eustace.io
    โญ 5,963

  57. sphinx-doc/sphinx
    The Sphinx documentation generator
    ๐Ÿ”— www.sphinx-doc.org
    โญ 5,940

  58. scikit-image/scikit-image
    Image processing in Python
    ๐Ÿ”— scikit-image.org
    โญ 5,777

  59. wireservice/csvkit
    A suite of utilities for converting to and working with CSV, the king of tabular file formats.
    ๐Ÿ”— csvkit.readthedocs.io
    โญ 5,738

  60. jd/tenacity
    Retrying library for Python
    ๐Ÿ”— tenacity.readthedocs.io
    โญ 5,737

  61. bndr/pipreqs
    pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.
    โญ 5,658

  62. icloud-photos-downloader/icloud_photos_downloader
    A command-line tool to download photos from iCloud
    โญ 5,631

  63. agronholm/apscheduler
    Task scheduling library for Python
    โญ 5,561

  64. ijl/orjson
    Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy
    โญ 5,341

  65. pytransitions/transitions
    A lightweight, object-oriented finite state machine implementation in Python with many extensions
    โญ 5,252

  66. pdfminer/pdfminer.six
    Community maintained fork of pdfminer - we fathom PDF
    ๐Ÿ”— pdfminersix.readthedocs.io
    โญ 5,188

  67. buildbot/buildbot
    Python-based continuous integration testing framework; your pull requests are more than welcome!
    ๐Ÿ”— www.buildbot.net
    โญ 5,148

  68. prompt-toolkit/ptpython
    A better Python REPL
    โญ 4,987

  69. pycqa/pycodestyle
    Simple Python style checker in one Python file
    ๐Ÿ”— pycodestyle.pycqa.org
    โญ 4,953

  70. rsalmei/alive-progress
    A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!
    โญ 4,928

  71. timdettmers/bitsandbytes
    Accessible large language models via k-bit quantization for PyTorch.
    โญ 4,904

  72. spotify/pedalboard
    ๐ŸŽ› ๐Ÿ”Š A Python library for working with audio.
    ๐Ÿ”— spotify.github.io/pedalboard
    โญ 4,748

  73. jorgebastida/awslogs
    AWS CloudWatch logs for Humansโ„ข
    โญ 4,725

  74. hhatto/autopep8
    A tool that automatically formats Python code to conform to the PEP 8 style guide.
    ๐Ÿ”— pypi.org/project/autopep8
    โญ 4,485

  75. pytoolz/toolz
    A functional standard library for Python.
    ๐Ÿ”— toolz.readthedocs.org
    โญ 4,464

  76. tebelorg/RPA-Python
    Python package for doing RPA
    โญ 4,392

  77. pyinvoke/invoke
    Pythonic task management & command execution.
    ๐Ÿ”— pyinvoke.org
    โญ 4,199

  78. bogdanp/dramatiq
    A fast and reliable background task processing library for Python 3.
    ๐Ÿ”— dramatiq.io
    โญ 3,945

  79. evhub/coconut
    Simple, elegant, Pythonic functional programming.
    ๐Ÿ”— coconut-lang.org
    โญ 3,907

  80. adafruit/circuitpython
    CircuitPython - a Python implementation for teaching coding with microcontrollers
    ๐Ÿ”— circuitpython.org
    โญ 3,815

  81. miguelgrinberg/python-socketio
    Python Socket.IO server and client
    โญ 3,689

  82. rspeer/python-ftfy
    Fixes mojibake and other glitches in Unicode text, after the fact.
    ๐Ÿ”— ftfy.readthedocs.org
    โญ 3,659

  83. joblib/joblib
    Computing with Python functions.
    ๐Ÿ”— joblib.readthedocs.org
    โญ 3,576

  84. python-markdown/markdown
    A Python implementation of John Gruberโ€™s Markdown with Extension support.
    ๐Ÿ”— python-markdown.github.io
    โญ 3,518

  85. zeromq/pyzmq
    PyZMQ: Python bindings for zeromq
    ๐Ÿ”— zguide.zeromq.org/py:all
    โญ 3,517

  86. ets-labs/python-dependency-injector
    Dependency injection framework for Python
    ๐Ÿ”— python-dependency-injector.ets-labs.org
    โญ 3,488

  87. pypi/warehouse
    The Python Package Index
    ๐Ÿ”— pypi.org
    โญ 3,437

  88. ashleve/lightning-hydra-template
    PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. โšก๐Ÿ”ฅโšก
    โญ 3,435

  89. tartley/colorama
    Simple cross-platform colored terminal text in Python
    โญ 3,368

  90. more-itertools/more-itertools
    More routines for operating on iterables, beyond itertools
    ๐Ÿ”— more-itertools.rtfd.io
    โญ 3,361

  91. osohq/oso
    Oso is a batteries-included framework for building authorization in your application.
    ๐Ÿ”— docs.osohq.com
    โญ 3,357

  92. pydata/xarray
    N-D labeled arrays and datasets in Python
    ๐Ÿ”— xarray.dev
    โญ 3,348

  93. jorisschellekens/borb
    borb is a library for reading, creating and manipulating PDF files in python.
    ๐Ÿ”— borbpdf.com
    โญ 3,247

  94. suor/funcy
    A fancy and practical functional tools
    โญ 3,227

  95. pyo3/maturin
    Build and publish crates with pyo3, rust-cpython and cffi bindings as well as rust binaries as python packages
    ๐Ÿ”— maturin.rs
    โญ 3,032

  96. spotify/basic-pitch
    A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
    ๐Ÿ”— basicpitch.io
    โญ 2,760

  97. tox-dev/pipdeptree
    A command line utility to display dependency tree of the installed Python packages
    ๐Ÿ”— pypi.python.org/pypi/pipdeptree
    โญ 2,631

  98. legrandin/pycryptodome
    A self-contained cryptographic library for Python
    ๐Ÿ”— www.pycryptodome.org
    โญ 2,623

  99. liiight/notifiers
    The easy way to send notifications
    ๐Ÿ”— notifiers.readthedocs.io
    โญ 2,578

  100. camelot-dev/camelot
    A Python library to extract tabular data from PDFs
    ๐Ÿ”— camelot-py.readthedocs.io
    โญ 2,539

  101. lxml/lxml
    The lxml XML toolkit for Python
    ๐Ÿ”— lxml.de
    โญ 2,535

  102. whylabs/whylogs
    An open-source data logging library for machine learning models and data pipelines. ๐Ÿ“š Provides visibility into data quality & model performance over time. ๐Ÿ›ก๏ธ Supports privacy-preserving data collection, ensuring safety & robustness. ๐Ÿ“ˆ
    ๐Ÿ”— whylogs.readthedocs.io
    โญ 2,496

  103. pyinfra-dev/pyinfra
    pyinfra automates infrastructure using Python. Itโ€™s fast and scales from one server to thousands. Great for ad-hoc command execution, service deployment, configuration management and more.
    ๐Ÿ”— pyinfra.com
    โญ 2,491

  104. pexpect/pexpect
    A Python module for controlling interactive programs in a pseudo-terminal
    ๐Ÿ”— pexpect.readthedocs.io
    โญ 2,491

  105. pyston/pyston
    A faster and highly-compatible implementation of the Python programming language.
    ๐Ÿ”— www.pyston.org
    โญ 2,481

  106. dosisod/refurb
    A tool for refurbishing and modernizing Python codebases
    โญ 2,436

  107. scrapinghub/dateparser
    python parser for human readable dates
    โญ 2,429

  108. yaml/pyyaml
    Canonical source repository for PyYAML
    โญ 2,387

  109. cdgriffith/Box
    Python dictionaries with advanced dot notation access
    ๐Ÿ”— github.com/cdgriffith/box/wiki
    โญ 2,328

  110. nschloe/tikzplotlib
    ๐Ÿ“Š Save matplotlib figures as TikZ/PGFplots for smooth integration into LaTeX.
    โญ 2,269

  111. pypa/setuptools
    Official project repository for the Setuptools build system
    ๐Ÿ”— pypi.org/project/setuptools
    โญ 2,252

  112. hgrecco/pint
    Operate and manipulate physical quantities in Python
    ๐Ÿ”— pint.readthedocs.org
    โญ 2,217

  113. dateutil/dateutil
    Useful extensions to the standard Python datetime features
    โญ 2,204

  114. ianmiell/shutit
    Automation framework for programmers
    ๐Ÿ”— ianmiell.github.io/shutit
    โญ 2,143

  115. pndurette/gTTS
    Python library and CLI tool to interface with Google Translate's text-to-speech API
    ๐Ÿ”— gtts.readthedocs.org
    โญ 2,104

  116. pyparsing/pyparsing
    Python library for creating PEG parsers
    โญ 2,049

  117. libaudioflux/audioFlux
    A library for audio and music analysis, feature extraction.
    ๐Ÿ”— audioflux.top
    โญ 1,989

  118. grantjenks/python-diskcache
    Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.
    ๐Ÿ”— www.grantjenks.com/docs/diskcache
    โญ 1,982

  119. google/gin-config
    Gin provides a lightweight configuration framework for Python
    โญ 1,967

  120. grahamdumpleton/wrapt
    A Python module for decorators, wrappers and monkey patching.
    โญ 1,936

  121. pyfilesystem/pyfilesystem2
    Python's Filesystem abstraction layer
    ๐Ÿ”— www.pyfilesystem.org
    โญ 1,931

  122. astanin/python-tabulate
    Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.
    ๐Ÿ”— pypi.org/project/tabulate
    โญ 1,919

  123. landscapeio/prospector
    Inspects Python source files and provides information about type and location of classes, methods etc
    โญ 1,894

  124. nateshmbhat/pyttsx3
    Offline Text To Speech synthesis for python
    โญ 1,841

  125. julienpalard/Pipe
    A Python library to use infix notation in Python
    โญ 1,828

  126. chaostoolkit/chaostoolkit
    Chaos Engineering Toolkit & Orchestration for Developers
    ๐Ÿ”— chaostoolkit.org
    โญ 1,811

  127. python-rope/rope
    a python refactoring library
    โญ 1,802

  128. joowani/binarytree
    Python Library for Studying Binary Trees
    ๐Ÿ”— binarytree.readthedocs.io
    โญ 1,799

  129. carpedm20/emoji
    emoji terminal output for Python
    โญ 1,792

  130. numba/llvmlite
    A lightweight LLVM python binding for writing JIT compilers
    ๐Ÿ”— llvmlite.pydata.org
    โญ 1,777

  131. pydoit/doit
    task management & automation tool
    ๐Ÿ”— pydoit.org
    โญ 1,753

  132. mitmproxy/pdoc
    API Documentation for Python Projects
    ๐Ÿ”— pdoc.dev
    โญ 1,750

  133. jcrist/msgspec
    A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
    ๐Ÿ”— jcristharif.com/msgspec
    โญ 1,744

  134. omry/omegaconf
    Flexible Python configuration system. The last one you will ever need.
    โญ 1,732

  135. kalliope-project/kalliope
    Kalliope is a framework that will help you to create your own personal assistant.
    ๐Ÿ”— kalliope-project.github.io
    โญ 1,688

  136. konradhalas/dacite
    Simple creation of data classes from dictionaries.
    โญ 1,611

  137. home-assistant/supervisor
    ๐Ÿก Home Assistant Supervisor
    ๐Ÿ”— home-assistant.io/hassio
    โญ 1,600

  138. rhettbull/osxphotos
    Python app to export pictures and associated metadata from Apple Photos on macOS. Also includes a package to provide programmatic access to the Photos library, pictures, and metadata.
    โญ 1,582

  139. rubik/radon
    Various code metrics for Python code
    ๐Ÿ”— radon.readthedocs.org
    โญ 1,578

  140. samuelcolvin/watchfiles
    Simple, modern and fast file watching and code reload in python.
    ๐Ÿ”— watchfiles.helpmanual.io
    โญ 1,523

  141. open-telemetry/opentelemetry-python
    OpenTelemetry Python API and SDK
    ๐Ÿ”— opentelemetry.io
    โญ 1,519

  142. pygments/pygments
    Pygments is a generic syntax highlighter written in Python
    ๐Ÿ”— pygments.org
    โญ 1,515

  143. mkdocstrings/mkdocstrings
    ๐Ÿ“˜ Automatic documentation from sources, for MkDocs.
    ๐Ÿ”— mkdocstrings.github.io
    โญ 1,505

  144. facebookincubator/Bowler
    Safe code refactoring for modern Python.
    ๐Ÿ”— pybowler.io
    โญ 1,505

  145. p0dalirius/Coercer
    A python script to automatically coerce a Windows server to authenticate on an arbitrary machine through 12 methods.
    ๐Ÿ”— podalirius.net
    โญ 1,495

  146. nficano/python-lambda
    A toolkit for developing and deploying serverless Python code in AWS Lambda.
    โญ 1,467

  147. hbldh/bleak
    A cross platform Bluetooth Low Energy Client for Python using asyncio
    โญ 1,443

  148. quodlibet/mutagen
    Python module for handling audio metadata
    ๐Ÿ”— mutagen.readthedocs.io
    โญ 1,411

  149. instagram/LibCST
    A concrete syntax tree parser and serializer library for Python that preserves many aspects of Python's abstract syntax tree
    ๐Ÿ”— libcst.readthedocs.io
    โญ 1,375

  150. fabiocaccamo/python-benedict
    ๐Ÿ“˜ dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xls, xml, yaml), s3 support and many utilities.
    โญ 1,366

  151. aws-samples/aws-glue-samples
    AWS Glue code samples
    โญ 1,366

  152. pycqa/pyflakes
    A simple program which checks Python source files for errors
    ๐Ÿ”— pypi.org/project/pyflakes
    โญ 1,326

  153. lidatong/dataclasses-json
    Easily serialize Data Classes to and from JSON
    โญ 1,268

  154. ossf/criticality_score
    Gives criticality score for an open source project
    โญ 1,264

  155. brandon-rhodes/python-patterns
    Source code behind the python-patterns.guide site by Brandon Rhodes
    โญ 1,234

  156. lcompilers/lpython
    Python compiler
    ๐Ÿ”— lpython.org
    โญ 1,205

  157. aio-libs/yarl
    Yet another URL library
    ๐Ÿ”— yarl.aio-libs.org
    โญ 1,159

  158. oracle/graalpython
    A Python 3 implementation built on GraalVM
    โญ 1,075

  159. pdoc3/pdoc
    ๐Ÿ โžก๏ธ ๐Ÿ“œ Auto-generate API documentation for Python projects
    ๐Ÿ”— pdoc3.github.io/pdoc
    โญ 1,060

  160. c4urself/bump2version
    Version-bump your software with a single command
    ๐Ÿ”— pypi.python.org/pypi/bump2version
    โญ 1,030

  161. metachris/logzero
    Robust and effective logging for Python 2 and 3.
    ๐Ÿ”— logzero.readthedocs.io
    โญ 1,030

  162. pyo3/rust-numpy
    PyO3-based Rust bindings of the NumPy C-API
    โญ 964

  163. fastai/fastcore
    Python supercharged for the fastai library
    ๐Ÿ”— fastcore.fast.ai
    โญ 894

  164. alex-sherman/unsync
    Unsynchronize asyncio
    โญ 863

  165. milvus-io/pymilvus
    Python SDK for Milvus.
    โญ 808

  166. samuelcolvin/dirty-equals
    Doing dirty (but extremely useful) things with equals.
    ๐Ÿ”— dirty-equals.helpmanual.io
    โญ 748

  167. barracuda-fsh/pyobd
    open source obd2 car diagnostics program - reuploaded
    โญ 695

  168. lastmile-ai/aiconfig
    AIConfig saves prompts, models and model parameters as source control friendly configs. This allows you to iterate on prompts and model parameters separately from your application code.
    ๐Ÿ”— aiconfig.lastmileai.dev
    โญ 689

  169. instagram/Fixit
    Advanced Python linting framework with auto-fixes and hierarchical configuration that makes it easy to write custom in-repo lint rules.
    ๐Ÿ”— fixit.rtfd.io/en/latest
    โญ 639

  170. pypa/build
    A simple, correct Python build frontend
    ๐Ÿ”— build.pypa.io
    โญ 631

  171. pypy/pypy
    PyPy is a very fast and compliant implementation of the Python language.
    ๐Ÿ”— pypy.org
    โญ 624

  172. gefyrahq/gefyra
    Blazingly-fast ๐Ÿš€, rock-solid, local application development โžก๏ธ with Kubernetes.
    ๐Ÿ”— gefyra.dev
    โญ 606

  173. anthropics/anthropic-sdk-python
    SDK providing access to Anthropic's safety-first language model APIs
    โญ 598

  174. open-telemetry/opentelemetry-python-contrib
    OpenTelemetry instrumentation for Python modules
    ๐Ÿ”— opentelemetry.io
    โญ 573

  175. methexis-inc/terminal-copilot
    A smart terminal assistant that helps you find the right command.
    โญ 553

  176. qdrant/qdrant-client
    Python client for Qdrant vector search engine
    ๐Ÿ”— qdrant.tech
    โญ 530

  177. steamship-core/steamship-langchain
    steamship-langchain
    โญ 502

  178. fastai/ghapi
    A delightful and complete interface to GitHub's amazing API
    ๐Ÿ”— ghapi.fast.ai
    โญ 500

Vizualisation

Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL.

  1. apache/superset
    Apache Superset is a Data Visualization and Data Exploration Platform
    ๐Ÿ”— superset.apache.org
    โญ 56,677

  2. streamlit/streamlit
    Streamlit โ€” A faster way to build and share data apps.
    ๐Ÿ”— streamlit.io
    โญ 30,253

  3. gradio-app/gradio
    Build and share delightful machine learning apps, all in Python. ๐ŸŒŸ Star to support our work!
    ๐Ÿ”— www.gradio.app
    โญ 26,394

  4. plotly/dash
    Data Apps & Dashboards for Python. No JavaScript Required.
    ๐Ÿ”— plotly.com/dash
    โญ 20,145

  5. matplotlib/matplotlib
    matplotlib: plotting with Python
    ๐Ÿ”— matplotlib.org/stable
    โญ 18,887

  6. bokeh/bokeh
    Interactive Data Visualization in the browser, from Python
    ๐Ÿ”— bokeh.org
    โญ 18,594

  7. plotly/plotly.py
    The interactive graphing library for Python โœจ This project now includes Plotly Express!
    ๐Ÿ”— plotly.com/python
    โญ 14,880

  8. mwaskom/seaborn
    Statistical data visualization in Python
    ๐Ÿ”— seaborn.pydata.org
    โญ 11,703

  9. visgl/deck.gl
    WebGL2 powered visualization framework
    ๐Ÿ”— deck.gl
    โญ 11,538

  10. marceloprates/prettymaps
    A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries.
    โญ 10,718

  11. altair-viz/altair
    Declarative statistical visualization library for Python
    ๐Ÿ”— altair-viz.github.io
    โญ 8,718

  12. nvidia/TensorRT-LLM
    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT...
    ๐Ÿ”— nvidia.github.io/tensorrt-llm
    โญ 5,291

  13. lux-org/lux
    Automatically visualize your pandas dataframe via a single print! ๐Ÿ“Š ๐Ÿ’ก
    โญ 4,872

  14. man-group/dtale
    Visualizer for pandas data structures
    ๐Ÿ”— alphatechadmin.pythonanywhere.com
    โญ 4,440

  15. renpy/renpy
    The Ren'Py Visual Novel Engine
    ๐Ÿ”— www.renpy.org
    โญ 4,416

  16. holoviz/panel
    Panel: The powerful data exploration & web app framework for Python
    ๐Ÿ”— panel.holoviz.org
    โญ 3,853

  17. has2k1/plotnine
    A Grammar of Graphics for Python
    ๐Ÿ”— plotnine.org
    โญ 3,744

  18. pyqtgraph/pyqtgraph
    Fast data visualization and GUI tools for scientific / engineering applications
    ๐Ÿ”— www.pyqtgraph.org
    โญ 3,595

  19. vispy/vispy
    Main repository for Vispy
    ๐Ÿ”— vispy.org
    โญ 3,185

  20. ml-tooling/opyrator
    ๐Ÿช„ Turns your machine learning code into microservices with web API, interactive GUI, and more.
    ๐Ÿ”— opyrator-playground.mltooling.org
    โญ 3,002

  21. netflix/flamescope
    FlameScope is a visualization tool for exploring different time ranges as Flame Graphs.
    โญ 2,963

  22. facebookresearch/hiplot
    HiPlot makes understanding high dimensional data easy
    ๐Ÿ”— facebookresearch.github.io/hiplot
    โญ 2,658

  23. holoviz/holoviews
    With Holoviews, your data visualizes itself.
    ๐Ÿ”— holoviews.org
    โญ 2,570

  24. mckinsey/vizro
    Vizro is a toolkit for creating modular data visualization applications.
    ๐Ÿ”— vizro.readthedocs.io/en/stable
    โญ 2,268

  25. pyvista/pyvista
    3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
    ๐Ÿ”— docs.pyvista.org
    โญ 2,202

  26. marcomusy/vedo
    A python module for scientific analysis of 3D data based on VTK and Numpy
    ๐Ÿ”— vedo.embl.es
    โญ 1,869

  27. datapane/datapane
    Build and share data reports in 100% Python
    ๐Ÿ”— datapane.com
    โญ 1,343

  28. facultyai/dash-bootstrap-components
    Bootstrap components for Plotly Dash
    ๐Ÿ”— dash-bootstrap-components.opensource.faculty.ai
    โญ 1,042

  29. nomic-ai/deepscatter
    Zoomable, animated scatterplots in the browser that scales over a billion points
    โญ 946

  30. hazyresearch/meerkat
    Creative interactive views of any dataset.
    โญ 792

Web

Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management.

  1. django/django
    The Web framework for perfectionists with deadlines.
    ๐Ÿ”— www.djangoproject.com
    โญ 75,661

  2. tiangolo/fastapi
    FastAPI framework, high performance, easy to learn, fast to code, ready for production
    ๐Ÿ”— fastapi.tiangolo.com
    โญ 68,368

  3. pallets/flask
    The Python micro framework for building web applications.
    ๐Ÿ”— flask.palletsprojects.com
    โญ 65,778

  4. psf/requests
    A simple, yet elegant, HTTP library.
    ๐Ÿ”— requests.readthedocs.io/en/latest
    โญ 51,050

  5. sherlock-project/sherlock
    ๐Ÿ”Ž Hunt down social media accounts by username across social networks
    ๐Ÿ”— sherlock-project.github.io
    โญ 47,770

  6. tornadoweb/tornado
    Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.
    ๐Ÿ”— www.tornadoweb.org
    โญ 21,437

  7. huge-success/sanic
    Accelerate your web app development | Build fast. Run fast.
    ๐Ÿ”— sanic.dev
    โญ 17,612

  8. pyscript/pyscript
    A framework that allows users to create rich Python applications in the browser using HTML's interface and the power of Pyodide, WASM, and modern web technologies.
    ๐Ÿ”— pyscript.net
    โญ 17,345

  9. wagtail/wagtail
    A Django content management system focused on flexibility and user experience
    ๐Ÿ”— wagtail.org
    โญ 16,787

  10. reflex-dev/reflex
    ๐Ÿ•ธ Web apps in pure Python ๐Ÿ
    ๐Ÿ”— reflex.dev
    โญ 14,960

  11. aio-libs/aiohttp
    Asynchronous HTTP client/server framework for asyncio and Python
    ๐Ÿ”— docs.aiohttp.org
    โญ 14,376

  12. getpelican/pelican
    Static site generator that supports Markdown and reST syntax. Powered by Python.
    ๐Ÿ”— getpelican.com
    โญ 12,022

  13. encode/httpx
    A next generation HTTP client for Python. ๐Ÿฆ‹
    ๐Ÿ”— www.python-httpx.org
    โญ 11,886

  14. aws/chalice
    Python Serverless Microframework for AWS
    โญ 10,209

  15. benoitc/gunicorn
    gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
    ๐Ÿ”— www.gunicorn.org
    โญ 9,375

  16. falconry/falcon
    The no-magic web data plane API and microservices framework for Python developers, with a focus on reliability, correctness, and performance at scale.
    ๐Ÿ”— falcon.readthedocs.io/en/stable
    โญ 9,340

  17. encode/starlette
    The little ASGI framework that shines. ๐ŸŒŸ
    ๐Ÿ”— www.starlette.io
    โญ 9,191

  18. flet-dev/flet
    Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.
    ๐Ÿ”— flet.dev
    โญ 8,387

  19. bottlepy/bottle
    bottle.py is a fast and simple micro-framework for python web-applications.
    ๐Ÿ”— bottlepy.org
    โญ 8,236

  20. graphql-python/graphene
    GraphQL framework for Python
    ๐Ÿ”— graphene-python.org
    โญ 7,920

  21. encode/uvicorn
    An ASGI web server, for Python. ๐Ÿฆ„
    ๐Ÿ”— www.uvicorn.org
    โญ 7,561

  22. reactive-python/reactpy
    ReactPy is a library for building user interfaces in Python without Javascript
    ๐Ÿ”— reactpy.dev
    โญ 7,522

  23. zauberzeug/nicegui
    Create web-based user interfaces with Python. The nice way.
    ๐Ÿ”— nicegui.io
    โญ 6,660

  24. pyeve/eve
    REST API framework designed for human beings
    ๐Ÿ”— python-eve.org
    โญ 6,652

  25. pallets/werkzeug
    The comprehensive WSGI web application library.
    ๐Ÿ”— werkzeug.palletsprojects.com
    โญ 6,509

  26. vitalik/django-ninja
    ๐Ÿ’จ Fast, Async-ready, Openapi, type hints based framework for building APIs
    ๐Ÿ”— django-ninja.dev
    โญ 5,895

  27. webpy/webpy
    web.py is a web framework for python that is as simple as it is powerful.
    ๐Ÿ”— webpy.org
    โญ 5,862

  28. stephenmcd/mezzanine
    CMS framework for Django
    ๐Ÿ”— mezzanine.jupo.org
    โญ 4,695

  29. pywebio/PyWebIO
    Write interactive web app in script way.
    ๐Ÿ”— pywebio.readthedocs.io
    โญ 4,263

  30. starlite-api/litestar
    Production-ready, Light, Flexible and Extensible ASGI API framework | Effortlessly Build Performant APIs
    ๐Ÿ”— litestar.dev
    โญ 3,973

  31. pylons/pyramid
    Pyramid - A Python web framework
    ๐Ÿ”— trypyramid.com
    โญ 3,883

  32. fastapi-users/fastapi-users
    Ready-to-use and customizable users management for FastAPI
    ๐Ÿ”— fastapi-users.github.io/fastapi-users
    โญ 3,879

  33. strawberry-graphql/strawberry
    A GraphQL library for Python that leverages type annotations ๐Ÿ“
    ๐Ÿ”— strawberry.rocks
    โญ 3,670

  34. websocket-client/websocket-client
    WebSocket client for Python
    ๐Ÿ”— github.com/websocket-client/websocket-client
    โญ 3,410

  35. unbit/uwsgi
    uWSGI application server container
    ๐Ÿ”— projects.unbit.it/uwsgi
    โญ 3,381

  36. pallets/quart
    An async Python micro framework for building web applications.
    ๐Ÿ”— quart.palletsprojects.com
    โญ 2,493

  37. flipkart-incubator/Astra
    Automated Security Testing For REST API's
    โญ 2,398

  38. fastapi-admin/fastapi-admin
    A fast admin dashboard based on FastAPI and TortoiseORM with tabler ui, inspired by Django admin
    ๐Ÿ”— fastapi-admin-docs.long2ice.io
    โญ 2,392

  39. masoniteframework/masonite
    The Modern And Developer Centric Python Web Framework. Be sure to read the documentation and join the Discord channel for questions: https://discord.gg/TwKeFahmPZ
    ๐Ÿ”— docs.masoniteproject.com
    โญ 2,122

  40. python-restx/flask-restx
    Fork of Flask-RESTPlus: Fully featured framework for fast, easy and documented API development with Flask
    ๐Ÿ”— flask-restx.readthedocs.io/en/latest
    โญ 2,042

  41. dot-agent/nextpy
    โšกBlazing Fast Pythonic App Framework from Future ๐Ÿ”ฎ: For Humans & LLMs ๐Ÿค–
    ๐Ÿ”— nextpy.org
    โญ 1,963

  42. cherrypy/cherrypy
    CherryPy is a pythonic, object-oriented HTTP framework. https://cherrypy.dev
    ๐Ÿ”— docs.cherrypy.dev
    โญ 1,762

  43. neoteroi/BlackSheep
    Fast ASGI web framework for Python
    ๐Ÿ”— www.neoteroi.dev/blacksheep
    โญ 1,641

  44. jordaneremieff/mangum
    AWS Lambda support for ASGI applications
    ๐Ÿ”— mangum.io
    โญ 1,546

  45. s3rius/FastAPI-template
    Feature rich robust FastAPI template.
    โญ 1,510

  46. wtforms/wtforms
    A flexible forms validation and rendering library for Python.
    ๐Ÿ”— wtforms.readthedocs.io
    โญ 1,450

  47. awtkns/fastapi-crudrouter
    A dynamic FastAPI router that automatically creates CRUD routes for your models
    ๐Ÿ”— fastapi-crudrouter.awtkns.com
    โญ 1,275

  48. magicstack/httptools
    Fast HTTP parser
    โญ 1,157

  49. long2ice/fastapi-cache
    fastapi-cache is a tool to cache fastapi response and function result, with backends support redis and memcached.
    ๐Ÿ”— github.com/long2ice/fastapi-cache
    โญ 1,035

  50. koxudaxi/fastapi-code-generator
    This code generator creates FastAPI app from an openapi file.
    โญ 879

  51. rstudio/py-shiny
    Shiny for Python
    ๐Ÿ”— shiny.posit.co/py
    โญ 864

  52. aeternalis-ingenium/FastAPI-Backend-Template
    A backend project template with FastAPI, PostgreSQL with asynchronous SQLAlchemy 2.0, Alembic for asynchronous database migration, and Docker.
    โญ 520


Interactive version: www.awesomepython.org, Hugging Face Dataset: awesome-python

Please raise a new issue to suggest a Python repo that you would like to see added.

1,354 hand-picked awesome Python libraries and frameworks, updated 27 Feb 2024

Hits

Subscribe to our newsletter