A common error you may encounter when using Python is modulenotfounderror: no module named ‘sklearn’. This error occurs when Python cannot detect the Scikit-learn library in your current environment, and Scikit-learn does not come with the default Python installation. This tutorial goes through the exact steps to troubleshoot this error for the Windows, Mac and Linux operating systems.
Table of contents
- ModuleNotFoundError: no module named ‘sklearn’
- What is Scikit-learn?
- How to install Scikit-learn on Windows Operating System
- How to install Scikit-learn on Mac Operating System
- How to install Scikit-learn on Linux Operating System
- Check Scikit-Learn Version
- Installing Scikit-Learn Using Anaconda
- Prerequisites Before Using Scikit-Learn
ModuleNotFoundError: no module named ‘sklearn’
What is ModuleNotFoundError?
The ModuleNotFoundError occurs when the module you want to use is not present in your Python environment. There are several causes of the modulenotfounderror:
The module’s name is incorrect, in which case you have to check the name of the module you tried to import. Let’s try to import the re module with a double e to see what happens:
--------------------------------------------------------------------------- ModuleNotFoundError Traceback (most recent call last) 1 import ree ModuleNotFoundError: No module named 'ree'
To solve this error, ensure the module name is correct. Let’s look at the revised code:
import re print(re.__version__)
You may want to import a local module file, but the module is not in the same directory. Let’s look at an example package with a script and a local module to import. Let’s look at the following steps to perform from your terminal:
mkdir example_package cd example_package mkdir folder_1 cd folder_1 vi module.py
Note that we use Vim to create the module.py file in this example. You can use your preferred file editor, such as Emacs or Atom. In module.py, we will import the re module and define a simple function that prints the re version:
import re def print_re_version(): print(re.__version__)
Close the module.py, then complete the following commands from your terminal:
cd ../ vi script.py
Inside script.py, we will try to import the module we created.
import module if __name__ == '__main__': mod.print_re_version()
Let’s run python script.py from the terminal to see what happens:
ModuleNotFoundError: No module named 'module'
To solve this error, we need to point to the correct path to module.py, which is inside folder_1. Let’s look at the revised code:
import folder_1.module as mod if __name__ == '__main__': mod.print_re_version()
When we run python script.py, we will get the following result:
Lastly, you can encounter the modulenotfounderror when you import a module that is not installed in your Python environment.
What is Scikit-learn?
Scikit-learn is a Python module for machine learning. The library is mainly written in Python and is built on NumPy, SciPy, and Matplotlib. The simplest way to install Scikit-learn is to use the package manager for Python called pip. The following instructions to install Scikit-learn are for the major Python version 3.
How to install Scikit-learn on Windows Operating System
You need to download and install Python on your PC. Ensure you select the install launcher for all users and Add Python to PATH checkboxes. The latter ensures the interpreter is in the execution path. Pip is automatically on Windows for Python versions 2.7.9+ and 3.4+.
You can install pip on Windows by downloading the installation package, opening the command line and launching the installer. You can install pip via the CMD prompt by running the following command.
You may need to run the command prompt as administrator. Check whether the installation has been successful by typing.
To install Scikit-learn with pip, run the following command from the command prompt.
pip install -U scikit-learn
How to install Scikit-learn on Mac Operating System
Open a terminal by pressing command (⌘) + Space Bar to open the Spotlight search. Type in terminal and press enter. To get pip, first ensure you have installed Python3.
You can install Python3 by using the Homebrew package manager:
/usr/bin/ruby -e "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/master/install)" export PATH="/usr/local/opt/python/libexec/bin:$PATH" # if you are on macOS 10.12 (Sierra) use `export PATH="/usr/local/bin:/usr/local/sbin:$PATH"` brew update brew install python # Python 3
Download pip by running the following curl command:
curl https://bootstrap.pypa.io/get-pip.py -o get-pip.py
The curl command allows you to specify a direct download link, and using the -o option sets the name of the downloaded file.
Install pip by running:
From the terminal, use pip3 to install Scikit-learn:
pip install -U scikit-learn
How to install Scikit-learn on Linux Operating System
All major Linux distributions have Python installed by default. However, you will need to install pip. You can install pip from the terminal, but the installation instructions depend on the Linux distribution you are using. You will need root privileges to install pip. Open a terminal and use the commands relevant to your Linux distribution to install pip.
Installing pip for Ubuntu, Debian, and Linux Mint
sudo apt install python-pip3
Installing pip for CentOS 8 (and newer), Fedora, and Red Hat
sudo dnf install python-pip3
Installing pip for CentOS 6 and 7, and older versions of Red Hat
sudo yum install epel-release sudo yum install python-pip3
Installing pip for Arch Linux and Manjaro
sudo pacman -S python-pip
Installing pip for OpenSUSE
sudo zypper python3-pip
Once you have installed pip, you can install Scikit-learn using:
pip install -U scikit-learn
Check Scikit-Learn Version
Once you have successfully installed Scikit-learn, you can use two methods to check the version of Scikit-learn. First, you can use pip show from your terminal.
pip show scikit-learn
Name: scikit-learn Version: 0.24.1 Summary: A set of python modules for machine learning and data mining Home-page: http://scikit-learn.org Author: None Author-email: None License: new BSD Location: /Users/Yusufu.Shehu/opt/anaconda3/lib/python3.8/site-packages Requires: threadpoolctl, numpy, scipy, joblib Required-by: mlxtend, imbalanced-learn
Second, within your python program, you can import Scikit-Learn and then reference the __version__ attribute:
import sklearn print(sklearn.__version__)
Installing Scikit-Learn Using Anaconda
Anaconda is a distribution of Python and R for scientific computing and data science. You can install Anaconda by going to the installation instructions. Once you have installed Anaconda, you can install Scikit-learn using the following command:
conda install -c conda-forge scikit-learn
Prerequisites Before Using Scikit-Learn
Before you can start using the latest release of scikit-learn, you must have the following installed:
- Python (>= 3.5)
- NumPy (>= 1.11.0)
- SciPy (>= 0.17.0)
- Joblib (>= 0.11)
- Matplotlib (>= 1.5.1) required for Scikit-Learn plotting capabilities
- Pandas (>= 0.18.0) is required for Scikit-learn data structure and analysis
Congratulations on reading to the end of this tutorial. The modulenotfounderror occurs if you misspell the module name, incorrectly point to the module path or do not have the module installed in your Python environment. If you do not have the module installed in your Python environment, you can use pip to install the package. However, you must ensure you have pip installed on your system. You can also install Anaconda on your system and use the conda install command to install the Scikit-learn library.
You may encounter a ModuleNotFoundError when trying to use a class or function from a module in Scikit-Learn. To solve this error, go to the article: How to Solve ModuleNotFoundError: No module named ‘sklearn.cross_validation’.
For further reading on installing data science and machine learning libraries, you can go to the articles:
- OpenCV: How to Solve Python ModuleNotFoundError: no module named ‘cv2’
- Requests: How to Solve Python ModuleNotFoundError: no module named ‘requests’
- Pandas: How to Solve Python ModuleNotFoundError: no module named ‘pandas’
- Matplotlib: How to Solve Python ModuleNotFoundError: no module named ‘matplotlib’
- Numpy: How to Solve Python ModuleNotFoundError: no module named ‘numpy’
- Imbalanced-learn: How to Solve Python ModuleNotFoundError: no module named ‘imblearn’
Go to the online courses page on Python to learn more about Python for data science and machine learning.
Have fun and happy researching!