Document Image Classification with Document Image Transformer (DiT)

Document Image Classification with Document Image Transformer (DiT)

tl;dr A step-by-step tutorial to automatically classify documents based on images of their contents. For example, automatically identify scientific papers or handwritten notes. Practical Machine Learning - Learn Step-by-Step to Train a Model A great way to learn is by going step-by-step through the process of training and evaluating the model. Hit the Open in Colab button below to launch a Jupyter Notebook in the cloud with a step-by-step walkthrough....

December 19, 2022 · 5 min · Eugene
Video Subtitling with OpenAI Whisper

Video Subtitling with OpenAI Whisper

tl;dr A step-by-step tutorial to automatically generate subtitles from a video using audio segmentation and OpenAI Whisper. Practical Machine Learning - Learn Step-by-Step to Train a Model A great way to learn is by going step-by-step through the process of training and evaluating the model. Hit the Open in Colab button below to launch a Jupyter Notebook in the cloud with a step-by-step walkthrough. Continue on if you prefer reading the code here....

December 16, 2022 · 8 min · Eugene
Face Super Resolution with Real ESRGAN

Face Super Resolution with Real ESRGAN

tl;dr A step-by-step tutorial to upscale images with faces in the foreground by Real ESRGAN. Practical Machine Learning - Learn Step-by-Step to Train a Model A great way to learn is by going step-by-step through the process of training and evaluating the model. Hit the Open in Colab button below to launch a Jupyter Notebook in the cloud with a step-by-step walkthrough. Continue on if you prefer reading the code here....

December 2, 2022 · 4 min · Eugene
Video Subtitling with Wav2Vec2

Video Subtitling with Wav2Vec2

tl;dr A step-by-step tutorial to automatically generate subtitles from a video using audio segmentation and Wav2Vec2. Practical Machine Learning - Learn Step-by-Step to Train a Model A great way to learn is by going step-by-step through the process of training and evaluating the model. Hit the Open in Colab button below to launch a Jupyter Notebook in the cloud with a step-by-step walkthrough. Continue on if you prefer reading the code here....

November 19, 2021 · 8 min · Eugene
Art Generation with v-diffusion

Art Generation with v Diffusion

tl;dr A step-by-step tutorial to generate art pieces from a text prompt. Practical Machine Learning - Learn Step-by-Step to Train a Model A great way to learn is by going step-by-step through the process of training and evaluating the model. Hit the Open in Colab button below to launch a Jupyter Notebook in the cloud with a step-by-step walkthrough. Continue on if you prefer reading the code here. Art Generation with v Diffusion Notebook to generate artwork from an input prompt using a pre-trained model....

November 16, 2021 · 3 min · Eugene
AnimeGANv2

Face to Anime with AnimeGANv2

tl;dr A step-by-step tutorial to use AnimeGANv2 models to automatically convert faces in photos to anime style faces. Practical Machine Learning - Learn Step-by-Step to Train a Model A great way to learn is by going step-by-step through the process of training and evaluating the model. Hit the Open in Colab button below to launch a Jupyter Notebook in the cloud with a step-by-step walkthrough. Continue on if you prefer reading the code here....

November 9, 2021 · 4 min · Eugene
AniCharaGAN - Anime Character Generation with StyleGAN2

AniCharaGAN: Anime Character Generation with StyleGAN2

tl;dr A step-by-step tutorial to automatically generate anime characters (full-body) using a StyleGAN2 model. Practical Machine Learning - Learn Step-by-Step to Train a Model A great way to learn is by going step-by-step through the process of training and evaluating the model. Hit the Open in Colab button below to launch a Jupyter Notebook in the cloud with a step-by-step walkthrough. Continue on if you prefer reading the code here....

November 8, 2021 · 3 min · Eugene
Mandarin Text to Speech with Coqui TTS.

Mandarin Text to Speech with Coqui TTS

tl;dr A step-by-step tutorial to generate spoken mandarin audio from text (语音合成) using the Coqui TTS library. Practical Machine Learning - Learn Step-by-Step to Train a Model A great way to learn is by going step-by-step through the process of training and evaluating the model. Hit the Open in Colab button below to launch a Jupyter Notebook in the cloud with a step-by-step walkthrough. Continue on if you prefer reading the code here....

November 3, 2021 · 3 min · Eugene
Optical Character Recognition from Images with TrOCR

OCR (Optical Character Recognition) from Images with Transformers

tl;dr A step-by-step tutorial to automatically recognise text (OCR) from images of handwritten and printed text using transformer encoder-decoder models. Practical Machine Learning - Learn Step-by-Step to Train a Model A great way to learn is by going step-by-step through the process of training and evaluating the model. Hit the Open in Colab button below to launch a Jupyter Notebook in the cloud with a step-by-step walkthrough. Continue on if you prefer reading the code here....

October 21, 2021 · 4 min · Eugene
Singlish Text to Speech with Malaya Speech

Singlish Text to Speech with Malaya Speech

tl;dr A step-by-step tutorial to generate spoken singlish audio from text automatically using a pipeline of a Malaya Speech model and applying speech enhancement. Practical Machine Learning - Learn Step-by-Step to Train a Model A great way to learn is by going step-by-step through the process of training and evaluating the model. Hit the Open in Colab button below to launch a Jupyter Notebook in the cloud with a step-by-step walkthrough....

October 21, 2021 · 4 min · Eugene