Машинное обучение и Большие данные: различия между версиями
Перейти к навигации
Перейти к поиску
Artem (обсуждение | вклад) |
Artem (обсуждение | вклад) |
||
(не показано 109 промежуточных версий этого же участника) | |||
Строка 1: | Строка 1: | ||
[[Файл:Ai-brain.jpg|400px]] | |||
= Software installation = | |||
* Anaconda - https://www.anaconda.com/products/individual | |||
** To remove (base) from PS1 - conda config --set changeps1 false | |||
* sklearn - https://scikit-learn.org/stable/ | |||
* CatBoost - https://catboost.ai/ | |||
* LightGBM - https://lightgbm.readthedocs.io/en/latest/ | |||
* XGBoost - https://xgboost.readthedocs.io/en/stable/ | |||
* Tensorflow - https://www.tensorflow.org/install - [[Tensorflow for old GPUs]] | |||
<math>x^2+y^2=z^2</math> | |||
= Курсы = | |||
* [https://www.youtube.com/watch?v=tPYj3fFJGjk TensorFlow 2.0 Complete Course - Python Neural Networks for Beginners Tutorial] | |||
* https://medium.com/nuances-of-programming/%D1%82%D0%BE%D0%BF-10-%D0%BA%D1%83%D1%80%D1%81%D0%BE%D0%B2-%D0%BF%D0%BE-%D0%BC%D0%B0%D1%88%D0%B8%D0%BD%D0%BD%D0%BE%D0%BC%D1%83-%D0%B8-%D0%B3%D0%BB%D1%83%D0%B1%D0%BE%D0%BA%D0%BE%D0%BC%D1%83-%D0%BE%D0%B1%D1%83%D1%87%D0%B5%D0%BD%D0%B8%D1%8E-%D0%B2-2020-1e1d870a24b7 | |||
* https://skillbox.ru/course/profession-data-scientist/ | |||
* https://new.geekbrains.ru/machine-learning | |||
* https://praktikum.yandex.ru/data-scientist/ | |||
* https://www.coursera.org/learn/machine-learning | |||
* https://www.coursera.org/specializations/deep-learning | |||
* https://ru.coursera.org/specializations/machine-learning-data-analysis | |||
* https://www.udacity.com/course/machine-learning--ud262 | |||
* https://www.udacity.com/course/intro-to-machine-learning--ud120 | |||
= Большие данные = | |||
* Лучшие в своём деле: Артур Хачуян | Большие данные — Big Data | ЛСД #7 https://www.youtube.com/watch?v=frLydE1UCvA | |||
* «На чём корпорации вертели вашу приватность», Артур Хачуян (Tazeros Global) https://www.youtube.com/watch?v=8IJxO44kq24 | |||
= Методы = | = Методы = | ||
Строка 20: | Строка 51: | ||
* SciPy - https://www.scipy.org/ | * SciPy - https://www.scipy.org/ | ||
* Pandas - https://pandas.pydata.org/ | * Pandas - https://pandas.pydata.org/ | ||
** [https://pandas.pydata.org/docs/getting_started/comparison/comparison_with_sql.html Pandas like SQL] | |||
* Scikit-learn - https://scikit-learn.org/stable/ | * Scikit-learn - https://scikit-learn.org/stable/ | ||
* Matplotlib - https://matplotlib.org/ | * Matplotlib - https://matplotlib.org/ | ||
Строка 31: | Строка 63: | ||
* Theanets - https://github.com/lmjohns3/theanets | * Theanets - https://github.com/lmjohns3/theanets | ||
* Keras - https://keras.io/ | * Keras - https://keras.io/ | ||
* OpenCV - https://opencv.org/ | |||
= Датасеты = | |||
* http://yann.lecun.com/exdb/mnist/ | |||
* http://www.image-net.org/ | |||
* https://en.wikipedia.org/wiki/List_of_datasets_for_machine_learning_research | |||
* https://storage.googleapis.com/openimages/web/index.html | |||
* https://github.com/philipperemy/timit | |||
* https://www.nist.gov/programs-projects/face-recognition-grand-challenge-frgc | |||
= NLP = | |||
* https://ru.wikipedia.org/wiki/GPT-3 | |||
* https://russiannlp.github.io/rugpt-demo/ | |||
* https://copy.ai | |||
* https://github.com/sberbank-ai/ru-gpts | |||
= Железо и драйверы = | |||
* https://developer.nvidia.com/cuda-downloads?target_os=Linux&target_arch=x86_64&target_distro=Ubuntu&target_version=1604&target_type=deblocal | |||
* https://www.tensorflow.org/install/gpu | |||
* https://cloud.google.com/tpu/ | |||
= Темы = | = Темы = | ||
* https://en.wikipedia.org/wiki/Outline_of_artificial_intelligence#Branches_of_artificial_intelligence | |||
== Face Recognition == | |||
* https://github.com/ageitgey/face_recognition | |||
* https://cmusatyalab.github.io/openface/ | |||
== Speech Recognition == | |||
* https://pypi.org/project/SpeechRecognition/ | |||
== Image Object Recognition == | |||
* https://imageai.readthedocs.io/en/latest/ | |||
** https://stackabuse.com/object-detection-with-imageai-in-python/ | |||
** https://towardsdatascience.com/object-detection-with-10-lines-of-code-d6cb4d86f606 | |||
== Anomaly Detection == | |||
== Prediction == | |||
== StereoVision == | == StereoVision == | ||
* https://medium.com/stereopi/opencv-and-depth-map-on-stereopi-tutorial-62cb6792bbed | * https://medium.com/stereopi/opencv-and-depth-map-on-stereopi-tutorial-62cb6792bbed | ||
= Anaconda = | |||
* https://www.anaconda.com/products/individual | |||
* https://docs.anaconda.com/anaconda/install/linux/ | |||
* https://mas-dse.github.io/startup/anaconda-ubuntu-install/ | |||
= TensorFlow = | |||
<embedvideo service="youtube" dimensions="800x450">https://www.youtube.com/watch?v=vfyZf2Wj3pU&list=PLA0M1Bcd0w8ynD1umfubKq1OBYRXhXkmH</embedvideo> | |||
<embedvideo service="youtube" dimensions="800x450">https://www.youtube.com/watch?v=tPYj3fFJGjk</embedvideo> | |||
= Некоторые полезные ресурсы = | = Некоторые полезные ресурсы = | ||
* | * https://ods.ai/ | ||
* | * https://en.wikipedia.org/wiki/Outline_of_artificial_intelligence | ||
* https://en.wikipedia.org/wiki/List_of_artificial_intelligence_projects | |||
* https://en.wikipedia.org/wiki/List_of_datasets_for_machine_learning_research | |||
* http://wordnet.princeton.edu/ | * http://wordnet.princeton.edu/ | ||
* http://project.phil.spbu.ru/RussNet/index_ru.shtml | * http://project.phil.spbu.ru/RussNet/index_ru.shtml | ||
Строка 51: | Строка 138: | ||
* https://archive.ics.uci.edu/ml/datasets/mushroom | * https://archive.ics.uci.edu/ml/datasets/mushroom | ||
* https://archive.ics.uci.edu/ml/index.php | * https://archive.ics.uci.edu/ml/index.php | ||
* https://www.kaggle.com/ | * https://www.kaggle.com/ | ||
* https://keras.io/ | * https://keras.io/ | ||
Строка 61: | Строка 147: | ||
* https://arxiv.org/list/cs.CV/recent | * https://arxiv.org/list/cs.CV/recent | ||
* https://yandex.ru/dev/catboost/ | * https://yandex.ru/dev/catboost/ | ||
* [[Теория вероятностей]] |
Текущая версия от 23:36, 19 октября 2023
Software installation
- Anaconda - https://www.anaconda.com/products/individual
- To remove (base) from PS1 - conda config --set changeps1 false
- sklearn - https://scikit-learn.org/stable/
- CatBoost - https://catboost.ai/
- LightGBM - https://lightgbm.readthedocs.io/en/latest/
- XGBoost - https://xgboost.readthedocs.io/en/stable/
- Tensorflow - https://www.tensorflow.org/install - Tensorflow for old GPUs
Курсы
- TensorFlow 2.0 Complete Course - Python Neural Networks for Beginners Tutorial
- https://medium.com/nuances-of-programming/%D1%82%D0%BE%D0%BF-10-%D0%BA%D1%83%D1%80%D1%81%D0%BE%D0%B2-%D0%BF%D0%BE-%D0%BC%D0%B0%D1%88%D0%B8%D0%BD%D0%BD%D0%BE%D0%BC%D1%83-%D0%B8-%D0%B3%D0%BB%D1%83%D0%B1%D0%BE%D0%BA%D0%BE%D0%BC%D1%83-%D0%BE%D0%B1%D1%83%D1%87%D0%B5%D0%BD%D0%B8%D1%8E-%D0%B2-2020-1e1d870a24b7
- https://skillbox.ru/course/profession-data-scientist/
- https://new.geekbrains.ru/machine-learning
- https://praktikum.yandex.ru/data-scientist/
- https://www.coursera.org/learn/machine-learning
- https://www.coursera.org/specializations/deep-learning
- https://ru.coursera.org/specializations/machine-learning-data-analysis
- https://www.udacity.com/course/machine-learning--ud262
- https://www.udacity.com/course/intro-to-machine-learning--ud120
Большие данные
- Лучшие в своём деле: Артур Хачуян | Большие данные — Big Data | ЛСД #7 https://www.youtube.com/watch?v=frLydE1UCvA
- «На чём корпорации вертели вашу приватность», Артур Хачуян (Tazeros Global) https://www.youtube.com/watch?v=8IJxO44kq24
Методы
- Теорема Байеса
- Функции ошибки и регуляризация
- Расстояние Кульбака-Лейблера и перекрестная энтропия
- Градиентный спуск: основы
- Граф вычислений и дифференцирование на нем
- Перцептрон
- Глубокие нейронные сети
- Классификация
- Кластеризация
- Регрессия
- Машинное зрение
- Метод к-средних
- word2vec
Библиотеки
- NumPy - https://www.numpy.org/
- SciPy - https://www.scipy.org/
- Pandas - https://pandas.pydata.org/
- Scikit-learn - https://scikit-learn.org/stable/
- Matplotlib - https://matplotlib.org/
- Gensim - https://radimrehurek.com/gensim/
- H2O - https://github.com/h2oai/h2o-tutorials
- XGBoost - https://xgboost.readthedocs.io/en/latest/
- CatBoost - https://yandex.ru/dev/catboost/
- Theano - http://deeplearning.net/software/theano/
- TensorFlow - https://www.tensorflow.org/
- sknn - https://scikit-neuralnetwork.readthedocs.io/en/latest/
- Theanets - https://github.com/lmjohns3/theanets
- Keras - https://keras.io/
- OpenCV - https://opencv.org/
Датасеты
- http://yann.lecun.com/exdb/mnist/
- http://www.image-net.org/
- https://en.wikipedia.org/wiki/List_of_datasets_for_machine_learning_research
- https://storage.googleapis.com/openimages/web/index.html
- https://github.com/philipperemy/timit
- https://www.nist.gov/programs-projects/face-recognition-grand-challenge-frgc
NLP
- https://ru.wikipedia.org/wiki/GPT-3
- https://russiannlp.github.io/rugpt-demo/
- https://copy.ai
- https://github.com/sberbank-ai/ru-gpts
Железо и драйверы
- https://developer.nvidia.com/cuda-downloads?target_os=Linux&target_arch=x86_64&target_distro=Ubuntu&target_version=1604&target_type=deblocal
- https://www.tensorflow.org/install/gpu
- https://cloud.google.com/tpu/
Темы
- https://en.wikipedia.org/wiki/Outline_of_artificial_intelligence#Branches_of_artificial_intelligence
Face Recognition
Speech Recognition
Image Object Recognition
Anomaly Detection
Prediction
StereoVision
Anaconda
- https://www.anaconda.com/products/individual
- https://docs.anaconda.com/anaconda/install/linux/
- https://mas-dse.github.io/startup/anaconda-ubuntu-install/
TensorFlow
Некоторые полезные ресурсы
- https://ods.ai/
- https://en.wikipedia.org/wiki/Outline_of_artificial_intelligence
- https://en.wikipedia.org/wiki/List_of_artificial_intelligence_projects
- https://en.wikipedia.org/wiki/List_of_datasets_for_machine_learning_research
- http://wordnet.princeton.edu/
- http://project.phil.spbu.ru/RussNet/index_ru.shtml
- http://wordnet.ru/
- http://www.vision.caltech.edu/Image_Datasets/Caltech101/
- https://www.scipy.org/
- http://www.numpy.org/
- http://leenissen.dk/fann/html/files/fann_train-h.html
- https://archive.ics.uci.edu/ml/datasets/mushroom
- https://archive.ics.uci.edu/ml/index.php
- https://www.kaggle.com/
- https://keras.io/
- https://xgboost.readthedocs.io/en/latest/
- https://lasagne.readthedocs.io/en/latest/
- http://deeplearning.net/software/theano/index.html
- https://catboost.yandex/
- https://arxiv.org/
- https://arxiv.org/list/cs.CV/recent
- https://yandex.ru/dev/catboost/