Appendix: Python Packages
Contents
Appendix: Python Packages¶
The following is general overview of packages available in Python that may be useful for data science.
For a much broader / fuller list of the Python ecosystem, check out the
Awesome Python list.
Data-Science Modules¶
These are all external (non-standard library) packages. Many of them are available in the Ananconda distribution.
Core Packages¶
scipy - mathematics, science, and engineering.
numpy - numerical computing with arrays & array operations.
pandas - data structures and data analysis.
scikit-learn - machine learning and data analysis.
Mathematics & Statistics¶
sympy - symbolic mathematics.
statsmodels - statistical modelling.
Plotting / Vizualization Libraries¶
matplotlib - 2D plotting library.
seaborn - visualization (based on matplotlib).
bokeh - interactive visualizations.
Graph Theory / Networks¶
networkx - network analysis.
graph-tool - manipulation and analysis of graphs.
Deep Learning¶
theano - mathematical operations on multi-dimensional arrays.
tensorflow - numerical computation using data flow graphs.
keras - a high-level neural network library.