You are on page 1of 2

https://www.xenith.co.

uk/blog/managed-print-
services-5-types-of-analytics-and-what-they-tell-us
https://www.xenith.co.uk/blog/big-data-can-it-help-control-your-printing-costs

https://github.com/donnemartin/data-science-
ipython-notebooks#pandas

What is the relationship among NumPy, SciPy,


Pandas, and Scikit-learn and when should I use
each one of them?
Answer

Follow · 100

Request

5 Answers

Begtin Ivan, I use Python 15+ years. I've created about 10 open data projects with Python
Answered May 7, 2016
Each toolkit has it's purpose:

 Numpy. Adds Python support for large, multi-dimensional arrays and


matrices, along with a large library of high-level mathematical functions to
operate on these arrays.
 SciPy is a collection of mathematical algorithms and convenience functions
built on the Numpy extension of Python. It adds significant power to the
interactive Python session by providing the user with high-level commands and
classes for manipulating and visualizing data.
 Pandas. Software library written for data manipulation and analysis in
Python. Offers data structures and operations for manipulating numerical
tables and time series.
 Scikit-learn is a Python module for machine learning built on top of SciPy
and distributed under the 3-Clause BSD license.
If you want to study machine learning then you need to know all of them.

This image also will be helpful.

It shows dependencies between these toolkits.

References

Data science Python notebooks donnemartin/data-science-ipython-notebook

Continuum analytics Anaconda http://quasiben.github.io/PyData...

You might also like