In this video I show you how to to load different file formats (json, csv, tsv) in Pytorch Torchtext using Fields, TabularDataset, BucketIterator to do all the heavy preprocessing for NLP tasks, such as numericalizing, padding, building vocabulary, which saves us a lot of time to focus on actually training the models! In this example I show a toy example dataset for sentiment analysis but the things we go through are general and can be adapted for any dataset.
Resources I used to learn about torchtext:
https://torchtext.readthedocs.io/en/l...
https://anie.me/OnTorchtext/
https://github.com/bentrevett
https://towardsdatascience.com/howto...
https://mlexplained.com/2018/02/08/a...
❤ Support the channel ❤
/ @aladdinpersson
Paid Courses I recommend for learning (affiliate links, no extra cost for you):
⭐ Machine Learning Specialization https://bit.ly/3hjTBBt
⭐ Deep Learning Specialization https://bit.ly/3YcUkoI
MLOps Specialization http://bit.ly/3wibaWy
GAN Specialization https://bit.ly/3FmnZDl
NLP Specialization http://bit.ly/3GXoQuP
✨ Free Resources that are great:
NLP: https://web.stanford.edu/class/cs224n/
CV: http://cs231n.stanford.edu/
Deployment: https://fullstackdeeplearning.com/
FastAI: https://www.fast.ai/
My Deep Learning Setup and Recording Setup:
https://www.amazon.com/shop/aladdinpe...
GitHub Repository:
https://github.com/aladdinpersson/Mac...
✅ OneTime Donations:
Paypal: https://bit.ly/3buoRYH
▶ You Can Connect with me on:
Twitter / aladdinpersson
LinkedIn / aladdinperssona95384153
Github https://github.com/aladdinpersson