tl;dr We feature 5 of the top machine learning code repositories on Github from Singapore. The Top 5 is made up of popular implementations of state-of-the-art Computer Vision (CV) and Natural Language Processing (NLP) models and even a high-frequency trading project. The ranking is decided based on the total stars (stargazer count) of the repositories.

5. PyTorch Implementation of EfficientDet

Architecture of EfficientDet including a weighted bi-directional feature pyramid network (BiFPN).

A PyTorch implementation of the 2019 Computer Vision paper EfficientDet: Scalable and Efficient Object Detection from Google Brain. The official implementation by Google Brain is in TensorFlow.

AuthorĐào Minh Toàn (toandaominh1997)
VocationData Scientist at VinID
LanguageStarsForksWatchersOpen Issues

4. Object Detection: YOLO3

Object bounding boxes on video footage as predicted by YOLO3. Retrieved from the official YOLO site.

A Computer Vision repository with code for training and evaluation of a YOLO3 model for the Object Detection task. YOLO, You Only Look Once, is a state-of-the-art, real-time object detection model. Its claim to fame is its extremely fast and accurate and you can trade-off speed and accuracy without re-training by changing the model size. Multi-GPU training is also implemented.

AuthorHuynh Ngoc Anh (experiencor)
VocationMachine Learning Engineer at Grab
LanguageStarsForksWatchersOpen Issues

3. Chinese Named Entity Recognition and Relation Extraction

Visualization of 3x3, 7x7 and 15x15 receptive fields produced by 1, 2 and 4 dilated convolutions by the IDCNN model.

An NLP repository including state-of-art deep learning methods for various tasks in chinese/mandarin language (中文): named entity recognition (NER/实体识别), relation extraction (RE/关系提取) and word segmentation.

LicenseNot Specified
AuthorWang Guan (crownpku)
VocationSenior Data Scientist, VP at Swiss Re
LanguageStarsForksWatchersOpen Issues

2. High-frequency Trading Model using the Interactive Brokers API

Demo of setting up the model using docker-compose. Retrieved from Github.

A high-frequency trading model using Interactive Brokers API with pairs and mean-reversion in Python. It was last updated with v3.0 in June 2019. The author describes the model as utilizing statistical arbitrage incorporating these methodologies:

  • Bootstrapping the model with historical data to derive usable strategy parameters
  • Resampling inhomogeneous time series to homogeneous time series
  • Selection of highly-correlated tradable pair
  • The ability to short one instrument and long the other.
  • Using volatility ratio to detect up or down trend.
  • Fair valuation of security using beta, or the mean over some past interval.
  • One pandas DataFrame to store historical prices
AuthorJames Ma (jamesmawm)
VocationFull-stack software engineer and author of Mastering Python for Finance.
LanguageStarsForksWatchersOpen Issues

1. DeepLab v3+ model in PyTorch

Some results of the deep labelling model on various datasets. Retrieved from Github.

A computer vision repository which started with an early PyTorch implementation (circa 2018) of DeepLab-V3-Plus (in PyTorch 0.4.1). DeepLab is a series of image semantic segmentation models whose latest version, v3+, is state-of-art on the semantic segmentation task. It can use Modified Aligned Xception and ResNet as backbone. The authors train DeepLab V3 Plus using Pascal VOC 2012, SBD and Cityscapes datasets. Pre-trained models on ResNet, MobileNet and DRN are provided.

AuthorJianfeng Zhang (jfzhang95)
VocationPhD Student at NUS
LanguageStarsForksWatchersOpen Issues

More ML repositories

Visit to view a full list of ML repositories from Singapore.