Thursday, February 27, 2020

[Links of the Day] 27/02/2020 : Finance Datasets, OS build using WASM and IPFS, Team Wiki

  • Quandl : financial, economic, and alternative datasets for financial professionals. A treasure trove of information, sadly not all free.
  • RedShirt : Operating system build using WASM that you can run off straight from IPFS ... Next thing you know we will have a blockchain-based package management system for an end to end trustworthy OS... Well as long as you have access to the internet.
  • Outline : wiki and knowledgebase for growing teams using the markdown language. You can run it straight of docker.

Tuesday, February 25, 2020

[Links of the Day] 25/02/2020 : Tensor flow deployment, Framework for automating machine learning pipeline

  • TensorFlow Deployment : A collection of tensorflow use case and infrastructure associated deployment patterns. Each example comes with code and often a ready to run docker image.
  • artificial neural network explained and demonstrated with code : The simplest form of an artificial neural network explained and demonstrated.
  • Aethos : an interesting project where the authors designed a library/platform to automates data science and analytical tasks at any stage in the pipeline. They offer a uniform API for automating analytical techniques from various libraries such as pandas, sci-kit learn, gensim, etc. It's still work in progress but has some potential.


Thursday, January 09, 2020

[Links of the Day] 09/01/2020 : SaaS Postmortem, MIT DeepLearning Lectures, Kafka GUI

  • A Failed SaaS Postmortem : an interesting postmortem of a failed SaaS. The TL;DR: too much focus on tech, not enough on customers.
  • MIT Deep Learning : 2019 lecture on deep learning, started this January.
  • KafkaHQ : Nice Kafka GUI for topics, topics data, consumers group, schema registry, connect and more.. If you want an alternative to it you can also check out Pulsar [pulsar] [pulsar dashboard]

Tuesday, January 07, 2020

[Links of the Day] 07/01/2020: 2019 AI index report, Essential Guide to electronics in Shenzhen, Tech Lead Expectations for engineering projects

AI index report 2019 : a little bit generic but still a good refresh of where we are and where we may be going.
Essential Guide to Electronics in Shenzhen : Huaqiangbei electronics market in Shenzhen is a must-visit for any tech aficionado. On top of the diversity of things, you can find you literally have access to a swarm of manufacturer and engineering group that can build whatever you need or can think of under one roof.
Tech Lead Expectations for Engineering Projects : an awesome overview of the tech lead by Gergely Orosz @uber. It provides a good overview and guidance of what this role consists of, and what the expectations are.






Thursday, January 02, 2020

[Links of the Day] 02/01/2020 : Video and Slides for Networking @scale and NeurIPS 2019 + AWS batch job monitoring


  • NeurIPS 2019 : All slides and video from the biggest AI / ML conference this year.
  • Networking @scale 2019 : All video from Facebook Networking at scale conference
  • batchiepatchie : A really cool project that allows you to monitor, gather metrics and display useful information about your AWS batch jobs.

Tuesday, December 31, 2019

[Links of the Day] 31/12/2019 : C++ STL iterators for GO, Hotchips 2019 and Rust implementation of Graph based Nearest Neighbour search


  • iterGo implementation of C++ STL iterators and algorithms. I really need to look into C++ again. Been a really long time since I touched it.
  • HotChips 2019 : All videos from talks at this year HotChips conference ( 31st of its kind!)
  • GranneGraph-based approximate nearest neighbour search in Rust, I really like Annoy from Spotify but this Rust implementation seems attractive enough that I will give it a spin. 




Thursday, December 19, 2019

[Links of the Day] 19/12/2019 : Machine learning at arXiv, Netflix human centered machine learning infrastructure management library, Filesystems are still not fully SSD aware

  • arXiv Machine Learning Classification Guide : how does ArXiv classify papers automatically with machine learning and what they plan to do with it in the future.
  • Metaflow : Netflix open source it's human-centred python library for managing machine learning infrastructure. It can user PyTorch, Tensorflow and Scikit. [website]
  • Evaluating File System Reliability on Solid State Drives : Your filesystem needs to understand the underlying hardware in order to guarantee reliability and security of the data stored. Nowadays the majority of storage solution relies on SSD but the authors discovered that many of the common filesystems are not fully SSD aware in their operations. They demonstrated that in 16% of the case faults resulted in irrecoverable failures.