Машинное обучение и Большие данные: различия между версиями

Материал из Artem Aleksashkin's Wiki
Перейти к навигации Перейти к поиску
Нет описания правки
 
(не показано 68 промежуточных версий этого же участника)
Строка 1: Строка 1:
[[Файл:Ai-brain.jpg|400px]]
[[Файл:Ai-brain.jpg|400px]]


= Hardware =


* Lenovo x230 + eGPU
= Software installation =
** [https://aliexpress.ru/item/32983647923.html Expresscard V8.0 EXP GDC Beast PCIe PCI-E]
** Блок питания на 350-600 ватт
** Nvidia GeForce 760 4gb
** [https://egpu.io/forums/builds/thinkpad-x230-express-card-2-0-5-gt-s-windows-10-by-boelly/ Similar setup]
** [https://egpu.io/forums/expresscard-mpcie-m-2-adapters/mpcieecngff-m2-resolving-detection-bootup-and-stability-problems/ Troubleshooting]
** 16 GB of mem will produce lags. Remove 1 stick of mem to 8 GB
** Be sure that you GPU conneted to power fully(8+6 or 8+8) - it can produce 43 error
** [https://www.youtube.com/watch?v=p59MNoqWY9c eGPU setup Lenovo Thinkpad x230 with GTX 760 Part 1 ( setup )]
** [https://www.youtube.com/watch?v=xJsHLTCo9Ho eGPU setup Lenovo Thinkpad x230 with GTX 760 Part 2 ( Fixing Error 12)]
** [https://www.youtube.com/watch?v=qOoY30pubBg eGPU setup Lenovo Thinkpad x230 with GTX 760 Part 3 ( Gameplay )]
** In Windows go to Control panel, Hardware setup, Nvidia settings, 3d graphics, There you can select default video adapter
*** But it won't help to use GPU in games - you need to connect external screen to GPU and disable laptop screen. Then games will run on eGPU.


= Software =
* Anaconda - https://www.anaconda.com/products/individual
* [https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html NVIDIA CUDA Installation Guide for Linux]
** To remove (base) from PS1 - conda config --set changeps1 false
<pre>
* sklearn - https://scikit-learn.org/stable/
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2004/x86_64/cuda-ubuntu2004.pin
* CatBoost - https://catboost.ai/
sudo mv cuda-ubuntu2004.pin /etc/apt/preferences.d/cuda-repository-pin-600
* LightGBM - https://lightgbm.readthedocs.io/en/latest/
wget https://developer.download.nvidia.com/compute/cuda/11.2.2/local_installers/cuda-repo-ubuntu2004-11-2-local_11.2.2-460.32.03-1_amd64.deb
* XGBoost - https://xgboost.readthedocs.io/en/stable/
sudo dpkg -i cuda-repo-ubuntu2004-11-2-local_11.2.2-460.32.03-1_amd64.deb
* Tensorflow - https://www.tensorflow.org/install - [[Tensorflow for old GPUs]]
sudo apt-key add /var/cuda-repo-ubuntu2004-11-2-local/7fa2af80.pub
sudo apt-get update
sudo apt-get -y install cuda nvidia-cuda-toolkit
</pre>
* [https://www.tensorflow.org/install/gpu TensorFlow for GPU]
* [https://developer.nvidia.com/rdp/cudnn-download cuDNN SDK]
* [https://developer.nvidia.com/nvidia-tensorrt-7x-download TensorRT]
* [https://towardsdatascience.com/installing-tensorflow-gpu-in-ubuntu-20-04-4ee3ca4cb75d Installing TensorFlow GPU in Ubuntu 20.04]
* https://developer.nvidia.com/cuda-gpus
== Change Default Python ==
<pre>
sudo update-alternatives --install /usr/bin/python python /usr/bin/python2 1
sudo update-alternatives --install /usr/bin/python python /usr/bin/python3 2
sudo update-alternatives --config python
</pre>
== Define Your Cuda Version ==
* '''Nvidia GeForce GTX 760 4gb''' -> '''Nvidia Kepler'''
* Nvidia Kepler -> CUDA SDK 10.0 – 10.2 support for compute capability 3.0 – 7.5 ('''Kepler''', Maxwell, Pascal, Volta, Turing). Last version with support for compute capability 3.x (Kepler). 10.2 is the last official release for macOS, as support will not be available for macOS in newer releases.


== Define Your Tensorflow Version ==
<math>x^2+y^2=z^2</math>


* Check all possible TensorFlow and Cuda versions here: https://www.tensorflow.org/install/source#gpu
= Курсы =
* For me - '''tenorflow-2.3.0''', '''cuda 10.2''', '''nvidia-440.33.0''', '''cuDNN 7.6''', '''Bazel 3.1.0''', '''GCC 7.3.1'''
** PS cuda 10.1 won't install due 418 driver won't comportable with new kernel 5.4.0-70-generic
<pre>
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/cuda-ubuntu1804.pin
sudo mv cuda-ubuntu1804.pin /etc/apt/preferences.d/cuda-repository-pin-600
wget https://developer.download.nvidia.com/compute/cuda/10.2/Prod/local_installers/cuda-repo-ubuntu1804-10-2-local-10.2.89-440.33.01_1.0-1_amd64.deb
sudo dpkg -i cuda-repo-ubuntu1804-10-2-local-10.2.89-440.33.01_1.0-1_amd64.deb
sudo apt-key add /var/cuda-repo-10-2-local-10.2.89-440.33.01/7fa2af80.pub
sudo apt-get update
sudo apt-get -y install cuda
</pre>
<pre>
>>> import tensorflow as tf
>>> tf.__version__
'2.3.0'
>>> tf.test.is_built_with_cuda()
True
</pre>
 
<pre>
$ nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2019 NVIDIA Corporation
Built on Sun_Jul_28_19:07:16_PDT_2019
Cuda compilation tools, release 10.1, V10.1.243
</pre>
 
<pre>
Python 3.8.5 (default, Jan 27 2021, 15:41:15)
[GCC 9.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import tensorflow as tf
>>> tf.config.list_physical_devices("GPU")
2021-03-29 00:19:23.520023: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcuda.so.1
2021-03-29 00:19:23.575281: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:981] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2021-03-29 00:19:23.575801: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1561] Found device 0 with properties:
pciBusID: 0000:04:00.0 name: GeForce GTX 760 computeCapability: 3.0
coreClock: 1.15GHz coreCount: 6 deviceMemorySize: 3.94GiB deviceMemoryBandwidth: 179.05GiB/s
2021-03-29 00:19:23.576789: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcudart.so.10.1
2021-03-29 00:19:23.581937: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcublas.so.10
2021-03-29 00:19:23.583502: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcufft.so.10
2021-03-29 00:19:23.585776: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcurand.so.10
2021-03-29 00:19:23.591336: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcusolver.so.10
2021-03-29 00:19:23.593271: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcusparse.so.10
2021-03-29 00:19:23.701034: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcudnn.so.7
2021-03-29 00:19:23.701468: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:981] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2021-03-29 00:19:23.702637: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:981] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2021-03-29 00:19:23.703486: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1657] Ignoring visible gpu device (device: 0, name: GeForce GTX 760, pci bus id: 0000:04:00.0, compute capability: 3.0) with Cuda compute capability 3.0. The minimum required Cuda capability is 3.5.
[]
</pre>
* https://medium.com/@mccann.matt/compiling-tensorflow-with-cuda-3-0-support-42d8fe0bf3b5
<pre>
git clone https://github.com/tensorflow/tensorflow.git
cd ./tensorflow
git checkout r2.2


sudo apt install apt-transport-https curl gnupg
* [https://www.youtube.com/watch?v=tPYj3fFJGjk TensorFlow 2.0 Complete Course - Python Neural Networks for Beginners Tutorial]
curl -fsSL https://bazel.build/bazel-release.pub.gpg | gpg --dearmor > bazel.gpg
sudo mv bazel.gpg /etc/apt/trusted.gpg.d/
echo "deb [arch=amd64] https://storage.googleapis.com/bazel-apt stable jdk1.8" | sudo tee /etc/apt/sources.list.d/bazel.list
sudo apt update && sudo apt install bazel-2.0.0
</pre>
 
= Курсы =
* https://medium.com/nuances-of-programming/%D1%82%D0%BE%D0%BF-10-%D0%BA%D1%83%D1%80%D1%81%D0%BE%D0%B2-%D0%BF%D0%BE-%D0%BC%D0%B0%D1%88%D0%B8%D0%BD%D0%BD%D0%BE%D0%BC%D1%83-%D0%B8-%D0%B3%D0%BB%D1%83%D0%B1%D0%BE%D0%BA%D0%BE%D0%BC%D1%83-%D0%BE%D0%B1%D1%83%D1%87%D0%B5%D0%BD%D0%B8%D1%8E-%D0%B2-2020-1e1d870a24b7
* https://medium.com/nuances-of-programming/%D1%82%D0%BE%D0%BF-10-%D0%BA%D1%83%D1%80%D1%81%D0%BE%D0%B2-%D0%BF%D0%BE-%D0%BC%D0%B0%D1%88%D0%B8%D0%BD%D0%BD%D0%BE%D0%BC%D1%83-%D0%B8-%D0%B3%D0%BB%D1%83%D0%B1%D0%BE%D0%BA%D0%BE%D0%BC%D1%83-%D0%BE%D0%B1%D1%83%D1%87%D0%B5%D0%BD%D0%B8%D1%8E-%D0%B2-2020-1e1d870a24b7
* https://skillbox.ru/course/profession-data-scientist/
* https://skillbox.ru/course/profession-data-scientist/
Строка 145: Строка 52:
* SciPy - https://www.scipy.org/
* SciPy - https://www.scipy.org/
* Pandas - https://pandas.pydata.org/
* Pandas - https://pandas.pydata.org/
** [https://pandas.pydata.org/docs/getting_started/comparison/comparison_with_sql.html Pandas like SQL]
* Scikit-learn - https://scikit-learn.org/stable/
* Scikit-learn - https://scikit-learn.org/stable/
* Matplotlib - https://matplotlib.org/
* Matplotlib - https://matplotlib.org/
Строка 166: Строка 74:
* https://github.com/philipperemy/timit
* https://github.com/philipperemy/timit
* https://www.nist.gov/programs-projects/face-recognition-grand-challenge-frgc
* https://www.nist.gov/programs-projects/face-recognition-grand-challenge-frgc
= NLP =
* https://ru.wikipedia.org/wiki/GPT-3
* https://russiannlp.github.io/rugpt-demo/
* https://copy.ai
* https://github.com/sberbank-ai/ru-gpts


= Железо и драйверы =
= Железо и драйверы =
Строка 196: Строка 111:


* https://medium.com/stereopi/opencv-and-depth-map-on-stereopi-tutorial-62cb6792bbed
* https://medium.com/stereopi/opencv-and-depth-map-on-stereopi-tutorial-62cb6792bbed
= Anaconda =
* https://www.anaconda.com/products/individual
* https://docs.anaconda.com/anaconda/install/linux/
* https://mas-dse.github.io/startup/anaconda-ubuntu-install/
= TensorFlow =
<embedvideo service="youtube" dimensions="800x450">https://www.youtube.com/watch?v=vfyZf2Wj3pU&list=PLA0M1Bcd0w8ynD1umfubKq1OBYRXhXkmH</embedvideo>
<embedvideo service="youtube" dimensions="800x450">https://www.youtube.com/watch?v=tPYj3fFJGjk</embedvideo>


= Некоторые полезные ресурсы =
= Некоторые полезные ресурсы =
Строка 221: Строка 148:
* https://arxiv.org/list/cs.CV/recent
* https://arxiv.org/list/cs.CV/recent
* https://yandex.ru/dev/catboost/
* https://yandex.ru/dev/catboost/
* [[Теория вероятностей]]
= Готовые нейронки =
== Видеомонтаж ==
* Adobe Premere Pro
* Final Cut Pro
* iMovie
* Filmora
* HitFilm Express
* DaVinci Resolve
* Camtasia
* Lightworks
* Shortcut
* OpenShot
* VSDC Free Video Editor
* Blender
* Avid Media Composer
* Movavi Video Editor
* VideoPad
* Magisto
* Animoto
* Lumen5
* IvVideo
* Clipchamp
== Создание контента ==
* Answer ThePublic
* Ahrefs
* SEMrush
* Moz
* Google Trends
* Ubersuggest
* BuzzSumo
* Social Animal
* ContentStudio
* Brand24
* Mention
* Feedly
* Quora
* Reddit
* Trendspottr
* BuzzStream
* Sprout Social
* Hootsuite Insights
* Followerwonk
* Nuzzel
== Автоматизация ==
* Buffer.com
* Hootsuite.com
* Later.com
* Sprout Social
* CoSchedule
* Zoho Social
* Loomly
* Sendible
* SocialBee
* Crowdfire
* Agorapulse
* Tailwind
* MeetEdgar
* SocialPilot
* Planoly
* Post Planner
* Iconosquare
* Pallyy
* RecurPost
* IFTTT
== Копирайтинг ==
* Grammarly
* Hemingway Editor
* Copy.ai
* Writesonic
* Jarvis (now Jasper)
* ProWritingAid
* QuillBot
* CoSchedule
* Portent's Content Generator
* Slick Write
* TextExpander
* CopySmith
* INK Editor
* Outranking
* Conversion.ai
* Copyscape
* Clearscope
* Frase
* SurferSEO
* ContentBot
== Графический дизайн ==
* Affinity Designer
* Adobe Spark
* Crello
* Pictochart
* Stencil
* Snappa
* Visme
* Desygner
* PicMonkey
* Easil
* Fotor
* Venngage
* Befunky
* RelayThat
* DesignBold
* FootJet
* Gravit Designer
* Pixlr
* Canva
* Inkscape
== Маркетинг ==
* Hootsuite
* Buffer
* Sprout Social
* HubSpot
* Marketo
* SEMrush
* Brandwatch
* Falcon.io
* Mailchimp
* Ahrefs
* Canva
* AdSpresso
* BuzzSumo
* CoSchedule
* SocialBee
* Sendible
* Later
* Loomly
* Agorapulse
* ContentStudio
== Изображения ==
* Shutterstock
* Getty Images
* Adobe Stock
* iStock
* Pexels
* Unsplash
* Pixabay
* Freepik
* Videvo
* Pond5
* Dreamstime
* Depositphotos
* Canva (stock library)
* Envato Elements
* Alamy
* Motion Array
* Videezy
* Clipstill
* Stocksy
* Dessolve

Текущая версия от 03:01, 21 сентября 2024

Ai-brain.jpg


Software installation

Курсы

Большие данные

Методы

  • Теорема Байеса
  • Функции ошибки и регуляризация
  • Расстояние Кульбака-Лейблера и перекрестная энтропия
  • Градиентный спуск: основы
  • Граф вычислений и дифференцирование на нем
  • Перцептрон
  • Глубокие нейронные сети
  • Классификация
  • Кластеризация
  • Регрессия
  • Машинное зрение
  • Метод к-средних
  • word2vec

Библиотеки

Датасеты

NLP

Железо и драйверы

Темы

Face Recognition

Speech Recognition

Image Object Recognition

Anomaly Detection

Prediction

StereoVision

Anaconda

TensorFlow

Некоторые полезные ресурсы

Готовые нейронки

Видеомонтаж

  • Adobe Premere Pro
  • Final Cut Pro
  • iMovie
  • Filmora
  • HitFilm Express
  • DaVinci Resolve
  • Camtasia
  • Lightworks
  • Shortcut
  • OpenShot
  • VSDC Free Video Editor
  • Blender
  • Avid Media Composer
  • Movavi Video Editor
  • VideoPad
  • Magisto
  • Animoto
  • Lumen5
  • IvVideo
  • Clipchamp

Создание контента

  • Answer ThePublic
  • Ahrefs
  • SEMrush
  • Moz
  • Google Trends
  • Ubersuggest
  • BuzzSumo
  • Social Animal
  • ContentStudio
  • Brand24
  • Mention
  • Feedly
  • Quora
  • Reddit
  • Trendspottr
  • BuzzStream
  • Sprout Social
  • Hootsuite Insights
  • Followerwonk
  • Nuzzel

Автоматизация

  • Buffer.com
  • Hootsuite.com
  • Later.com
  • Sprout Social
  • CoSchedule
  • Zoho Social
  • Loomly
  • Sendible
  • SocialBee
  • Crowdfire
  • Agorapulse
  • Tailwind
  • MeetEdgar
  • SocialPilot
  • Planoly
  • Post Planner
  • Iconosquare
  • Pallyy
  • RecurPost
  • IFTTT

Копирайтинг

  • Grammarly
  • Hemingway Editor
  • Copy.ai
  • Writesonic
  • Jarvis (now Jasper)
  • ProWritingAid
  • QuillBot
  • CoSchedule
  • Portent's Content Generator
  • Slick Write
  • TextExpander
  • CopySmith
  • INK Editor
  • Outranking
  • Conversion.ai
  • Copyscape
  • Clearscope
  • Frase
  • SurferSEO
  • ContentBot

Графический дизайн

  • Affinity Designer
  • Adobe Spark
  • Crello
  • Pictochart
  • Stencil
  • Snappa
  • Visme
  • Desygner
  • PicMonkey
  • Easil
  • Fotor
  • Venngage
  • Befunky
  • RelayThat
  • DesignBold
  • FootJet
  • Gravit Designer
  • Pixlr
  • Canva
  • Inkscape

Маркетинг

  • Hootsuite
  • Buffer
  • Sprout Social
  • HubSpot
  • Marketo
  • SEMrush
  • Brandwatch
  • Falcon.io
  • Mailchimp
  • Ahrefs
  • Canva
  • AdSpresso
  • BuzzSumo
  • CoSchedule
  • SocialBee
  • Sendible
  • Later
  • Loomly
  • Agorapulse
  • ContentStudio

Изображения

  • Shutterstock
  • Getty Images
  • Adobe Stock
  • iStock
  • Pexels
  • Unsplash
  • Pixabay
  • Freepik
  • Videvo
  • Pond5
  • Dreamstime
  • Depositphotos
  • Canva (stock library)
  • Envato Elements
  • Alamy
  • Motion Array
  • Videezy
  • Clipstill
  • Stocksy
  • Dessolve