Sunbelt Computer Software

🛠 PyCaret 4.0 development has started — here's what's happening

If you're here today, you're most likely on the master branch (PyCaret 3.4.0) — still installable via pip install pycaret, still works for existing 3.x code. But development has shifted: the 4.0 revamp is live on the v4 branch and we want to be transparent about what's going on.

Why. PyCaret was released in 2020, widely adopted, and then went roughly three years without active maintenance. The consequences are real: it doesn't install cleanly on Python 3.12+, doesn't work with modern scikit-learn (≥1.5) or NumPy 2, and has 300+ open issues — many from people hitting exactly these compatibility cliffs. We owe you a maintained library.

What. PyCaret 4.0 is a ground-up rebuild designed around how people actually use ML libraries in 2026:

Sklearn-composable OOP engine. ClassificationExperiment, RegressionExperiment, etc. are proper sklearn.base.BaseEstimator subclasses. get_params, clone, __sklearn_tags__ all work.

Lean. Core dependencies cut from 30 → 19. Unmaintained integrations removed: mlflow, comet, wandb, dagshub, fugue, dask, ray, yellowbrick, gradio, fastapi, boto3, m2cgen, evidently, fairlearn. Full list at docs/revamp/KILL_LIST.md.

Modern stack. Works on Python 3.11 / 3.12 / 3.13, scikit-learn 1.7, NumPy 2, pandas 2.x. uv for environment management.

Agent- and UI-native. Typed dataclass returns from every verb (CompareResult, TuneResult, ...), a structured event stream (pycaret.logging), and JSON-serializable introspection (pycaret.api). The forthcoming open-source PyCaret React UI runs on this engine.

Current status (updated regularly): ~22K lines of tech debt removed. 32/32 tests green on the new OOP surface across Python 3.11 / 3.12 / 3.13 × Ubuntu + Windows. Five canonical notebooks executing end-to-end. The 3.x god-class is being drained verb-by-verb; the public API is stable from here on.

Timeline. No firm date — 4.0 ships when it's ready. First installable release will be 4.0.0alpha0 once the first three verbs are fully migrated off the legacy internals. Tracking in docs/revamp/ROADMAP.md and docs/revamp/STATUS.md.

3.x will keep working — no forced migration, no 3.x EOL date. If you don't want to move, don't. We're not going to break your existing code.

Try 4.0 today:
git clone -b v4 https://github.com/pycaret/pycaret.git
cd pycaret
uv sync --all-extras
uv run pytest tests/                     # -> 32 passed
uv run jupyter notebook notebooks/01_classification.ipynb
How this is being built. PyCaret 4.0 is being built collaboratively with Claude (Anthropic's coding agent). Every non-trivial change, every architectural decision, and every trade-off is documented in docs/revamp/release_notes_pycaret4.md — commit-by-commit, session-by-session. The goal is both a better library and a reproducible case study of AI-assisted open-source revival.

Useful links on v4:

README.md — 4.0 quickstart

AGENTS.md — instructions for AI contributors

docs/revamp/ARCHITECTURE.md — the design

docs/revamp/KILL_LIST.md — what was removed and why

docs/revamp/STATUS.md — current session status, refreshed every session

CONTRIBUTING.md — contributor guide for 4.0

Issues. If you're hitting a compat cliff on 3.x (Python 3.12+, sklearn 1.5+, NumPy 2, pandas 2.2+), please don't file new issues — those are already fixed in 4.0. Try the v4 branch instead.

For everything else: 3.x README continues below.

An open-source, low-code machine learning library in Python

🎉🎉🎉 PyCaret 3.4 is now available. 🎉🎉🎉

`pip install --upgrade pycaret`

Docs • Tutorials • Blog • LinkedIn • YouTube • Slack

Overview
CI/CD
Code
Downloads
License
Community

Welcome to PyCaret

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows. It is an end-to-end machine learning and model management tool that speeds up the experiment cycle exponentially and makes you more productive.

In comparison with the other open-source machine learning libraries, PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with few lines only. This makes experiments exponentially fast and efficient. PyCaret is essentially a Python wrapper around several machine learning libraries and frameworks such as scikit-learn, XGBoost, LightGBM, CatBoost, Optuna, Hyperopt, Ray, and few more.

The design and simplicity of PyCaret are inspired by the emerging role of citizen data scientists, a term first used by Gartner. Citizen Data Scientists are power users who can perform both simple and moderately sophisticated analytical tasks that would previously have required more technical expertise. PyCaret was inspired by the caret library in R programming language.

🚀 Installation

🌐 Option 1: Install via PyPi

PyCaret is tested and supported on 64-bit systems with:

Python 3.9, 3.10, 3.11 and 3.12
Ubuntu 16.04 or later
Windows 7 or later

You can install PyCaret with Python's pip package manager:

# install pycaret
pip install pycaret

PyCaret's default installation will not install all the optional dependencies automatically. Depending on the use case, you may be interested in one or more extras:

# install analysis extras
pip install pycaret[analysis]

# models extras
pip install pycaret[models]

# install tuner extras
pip install pycaret[tuner]

# install mlops extras
pip install pycaret[mlops]

# install parallel extras
pip install pycaret[parallel]

# install test extras
pip install pycaret[test]

# install dev extras
pip install pycaret[dev]

##

# install multiple extras together
pip install pycaret[analysis,models]

Check out all optional dependencies. If you want to install everything including all the optional dependencies:

# install full version
pip install pycaret[full]

📄 Option 2: Build from Source

Install the development version of the library directly from the source. The API may be unstable. It is not recommended for production use.

pip install git+https://github.com/pycaret/pycaret.git@master --upgrade

📦 Option 3: Docker

Docker creates virtual environments with containers that keep a PyCaret installation separate from the rest of the system. PyCaret docker comes pre-installed with a Jupyter notebook. It can share resources with its host machine (access directories, use the GPU, connect to the Internet, etc.). The PyCaret Docker images are always tested for the latest major releases.

# default version
docker run -p 8888:8888 pycaret/slim

# full version
docker run -p 8888:8888 pycaret/full

🏃‍♂️ Quickstart

1. Functional API

# Classification Functional API Example

# loading sample dataset
from pycaret.datasets import get_data
data = get_data('juice')

# init setup
from pycaret.classification import *
s = setup(data, target = 'Purchase', session_id = 123)

# model training and selection
best = compare_models()

# evaluate trained model
evaluate_model(best)

# predict on hold-out/test set
pred_holdout = predict_model(best)

# predict on new data
new_data = data.copy().drop('Purchase', axis = 1)
predictions = predict_model(best, data = new_data)

# save model
save_model(best, 'best_pipeline')

2. OOP API

# Classification OOP API Example

# loading sample dataset
from pycaret.datasets import get_data
data = get_data('juice')

# init setup
from pycaret.classification import ClassificationExperiment
s = ClassificationExperiment()
s.setup(data, target = 'Purchase', session_id = 123)

# model training and selection
best = s.compare_models()

# evaluate trained model
s.evaluate_model(best)

# predict on hold-out/test set
pred_holdout = s.predict_model(best)

# predict on new data
new_data = data.copy().drop('Purchase', axis = 1)
predictions = s.predict_model(best, data = new_data)

# save model
s.save_model(best, 'best_pipeline')

📁 Modules

Classification

Functional API	OOP API

Regression

Functional API	OOP API

Time Series

Functional API	OOP API

Clustering

Functional API	OOP API

Anomaly Detection

Functional API	OOP API

👥 Who should use PyCaret?

PyCaret is an open source library that anybody can use. In our view the ideal target audience of PyCaret is:

Experienced Data Scientists who want to increase productivity.
Citizen Data Scientists who prefer a low code machine learning solution.
Data Science Professionals who want to build rapid prototypes.
Data Science and Machine Learning students and enthusiasts.

🎮 Training on GPUs

To train models on the GPU, simply pass use_gpu = True in the setup function. There is no change in the use of the API; however, in some cases, additional libraries have to be installed. The following models can be trained on GPUs:

Extreme Gradient Boosting
CatBoost
Light Gradient Boosting Machine requires GPU installation
Logistic Regression, Ridge Classifier, Random Forest, K Neighbors Classifier, K Neighbors Regressor, Support Vector Machine, Linear Regression, Ridge Regression, Lasso Regression requires cuML >= 0.15

🖥️ PyCaret Intel sklearnex support

You can apply Intel optimizations for machine learning algorithms and speed up your workflow. To train models with Intel optimizations use sklearnex engine. There is no change in the use of the API, however, installation of Intel sklearnex is required:

pip install scikit-learn-intelex

🤝 Contributors

📝 License

PyCaret is completely free and open-source and licensed under the MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 5,373 Commits
.github		.github
Docker_files		Docker_files
datasets		datasets
docs		docs
pycaret		pycaret
tests		tests
tutorials		tutorials
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
.slugignore		.slugignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
mypy.ini		mypy.ini
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg

Important Links	Description
⭐ Tutorials	Tutorials developed and maintained by core developers
📋 Example Notebooks	Example notebooks created by community
📙 Blog	Official blog by creator of PyCaret
📚 Documentation	API docs
📺 Videos	Video resources
✈️ Cheat sheet	Community Cheat sheet
📢 Discussions	Community Discussion board on GitHub
🛠️ Release Notes	Release Notes

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛠 PyCaret 4.0 development has started — here's what's happening

An open-source, low-code machine learning library in Python

🎉🎉🎉 PyCaret 3.4 is now available. 🎉🎉🎉

`pip install --upgrade pycaret`

Docs • Tutorials • Blog • LinkedIn • YouTube • Slack

Welcome to PyCaret

🚀 Installation

🌐 Option 1: Install via PyPi

📄 Option 2: Build from Source

📦 Option 3: Docker

🏃‍♂️ Quickstart

1. Functional API

2. OOP API

📁 Modules

Classification

Regression

Time Series

Clustering

Anomaly Detection

👥 Who should use PyCaret?

🎮 Training on GPUs

🖥️ PyCaret Intel sklearnex support

🤝 Contributors

📝 License

ℹ️ More Information

About

Uh oh!

Releases 30

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Sunbelt Computer Software

PL/B Language Development and Support

Folders and files

Latest commit

History

Repository files navigation

🛠 PyCaret 4.0 development has started — here's what's happening

An open-source, low-code machine learning library in Python

🎉🎉🎉 PyCaret 3.4 is now available. 🎉🎉🎉

pip install --upgrade pycaret

Docs • Tutorials • Blog • LinkedIn • YouTube • Slack

Welcome to PyCaret

🚀 Installation

🌐 Option 1: Install via PyPi

📄 Option 2: Build from Source

📦 Option 3: Docker

🏃‍♂️ Quickstart

1. Functional API

2. OOP API

📁 Modules

Classification

Regression

Time Series

Clustering

Anomaly Detection

👥 Who should use PyCaret?

🎮 Training on GPUs

🖥️ PyCaret Intel sklearnex support

🤝 Contributors

📝 License

ℹ️ More Information

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 30

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`pip install --upgrade pycaret`

Packages