Thursday, 23 May 2019

Data science tools Part 1

SciPy – 
This is a Python-based ecosystem of open-source software for mathematics, science, and engineering. 
Related image
SciPy uses various packages like NumPy, IPython or Pandas to provide libraries for common

math- and science-oriented programming tasks. This tool is a great option when you want 

to manipulate numbers on a computer and display or publish the results and it is free as well.

Dask –

 Dask is a tool providing parallelism for analytics by integrating into other community projects like NumPy, Pandas and Scikit-Learn.
Image result for dask

 With this too, you can quickly parallelize existing code by changing only a few lines of code, 
since its DataFrame is the same as in the Pandas library, its Array object works like 

NumPy’s has the ability to parallelize jobs written in pure Python, as well.

No comments:

Post a Comment