
Wraz ze wzrostem popularności rozwiązań Data Science i Sztucznej Inteligencji, coraz więcej ludzi poszukuje odpowiednich bibliotek i narzędzi pozwalających realizować takie projekty.
Postanowiłem podzielić się swoją listą narzędzi i bibliotek które można użyć w tego typu projektach. Jeśli ktoś woli format arkusza, lista jest dostępna także tutaj. Jeśli uważasz, że warto dodać do tej listy jakieś narzędzie, proszę o informację na maila lub komentarz pod wpisem lub w arkuszu.
Nazwa | Strona | Kod żródłowy | Komentarz | Opis |
---|---|---|---|---|
Apache FlinkML | link | link | Szybko rosnąca popularność | Machine Learning (ML) library for Flink |
Apache Hivemall | link | link | Inkubacja | Hivemall is a scalable machine learning library that runs on Apache Hive, Spark and Pig. |
Apache Mahout | link | link | Free implementations of distributed or otherwise scalable machine learning algorithms | |
Apache MXNet | link | link | Inkubacja, szybko rosnąca popularność | |
Apache OpenNLP | link | Bardzo popularna | The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text | |
Apache SINGA | link | link | ||
Apache Spark -> Databricks Deep Learning Pipelines for Apache Spark | link | |||
Apache Spark -> MLlib | link | link | Bardzo popularna | MLlib is Apache Spark’s scalable machine learning library. |
Caffe | link | link | ||
Caffe2 | link | link | Bardzo popularna | Convolutional Architecture for Fast Feature Embedding. A New Lightweight, Modular, and Scalable Deep Learning Framework |
CatBoost | link | link | ||
Chainer | link | link | Chainer is an open source deep learning framework written purely in Python on top of Numpy and CuPy Python libraries | |
DeepLearning4J | link | link | Deep learning programming library written for Java and the Java virtual machine (JVM) | |
Elephas | link | Distributed Deep Learning with Keras & Spark | ||
Gluon | link | A clear, concise, simple yet powerful and efficient API for deep learning. | ||
H2O | link | link | Bardzo popularna | Data Science and Machine Learning Platform |
Horovod | link | link | Distributed training framework for TensorFlow, Keras, PyTorch, and MXNet. | |
Intel Analytics Zoo | link | link | A unified analytics + AI platform for distributed TensorFlow, Keras and BigDL on Apache Spark | |
Intel BigDL | link | link | BigDL is a distributed deep learning library for Apache Spark | |
Intel Data Analytics Acceleration Library (Intel DAAL) | ||||
Intel Math Kernel Library (Intel MKL) | ||||
Keras | link | link | Bardzo popularna | Keras is a high-level neural networks API, written in Python and capable of running on top of TensorFlow, CNTK, or Theano. |
Keras -> Cern Distributed Keras | link | Nie rozwijana | ||
MATLAB | link | Laboratorium Macierzy (MatrixLaboratory) | ||
Microsoft Cognitive Toolkit (CNTK) | ||||
Natural Language Toolkit (NLTK) | link | zestaw bibliotek i programów do symbolicznego i statystycznego przetwarzania języka naturalnego | ||
Neural Designer | ||||
ONNX (Open Neural Network Exchange) | ||||
OpenNN | link | OpenNN is a free neural networks library for advanced analytics. | ||
PyTorch | link | link | Bardzo popularna | An open source deep learning platform that provides a seamless path from research prototyping to production deployment. |
PyTorch -> AllenNLP | link | |||
R Project | link | Popularny, traci na rzecz Pythona i innych języków | R is a free software environment for statistical computing and graphics | |
Scikit-learn | link | Machine Learning in Python | ||
SciPy | link | Python library used for scientific computing and technical computing | ||
SciPy -> IPython | link | A powerful interactive shell. A kernel for Jupyter. | ||
SciPy -> Matplotlib | link | biblioteka do tworzenia wykresów dla języka programowania Python i jego rozszerzenia numerycznego NumPy | ||
SciPy -> NumPy | link | NumPy is the fundamental package for scientific computing with Python | ||
SciPy -> Pandas | link | library written for the Python programming language for data manipulation and analysis | ||
SciPy -> Sympy | link | SymPy is a Python library for symbolic mathematics. | ||
SciPy library | link | The SciPy library is one of the core packages that make up the SciPy stack | ||
TensorFlow | link | link | Bardzo popularna | |
TensorFlow -> Yahoo TensorFlowOnSpark | link | |||
Theano | link | link | Nie rozwijana | Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently |
Torch | link | link | Nie rozwijana | A scientific computing framework for LuaJIT |
XGBoost | link | link | XGBoost is an optimized distributed gradient boosting library designed to be highly efficient, flexible and portable. |