Thursday, November 29, 2018

[Links of the Day] 29/11/2018 : AWS Cloudformation testing , The Dassault model, Word2Vec Anything

  • taskcat : Testing tools for AWS Cloudformation templates. Deploys your template in one or more regions and report if there is an issue. 
  • A Dassault Dossier : excellent Rand report on Dassault aviation and how the company was able to deliver the entire french aircraft combat fleet for the past two decades. It's all about development focus rather than production focus. 
  • Basilica : word2vec for anything. And the concept is amazing, seriously, you can pretty much get a similarity measure for any type of data.


Tuesday, November 27, 2018

[Links of the Day] 27/11/2018 : AWS deployment workflow framework, Dockerize your dev workflow, Nuke your AWS account

  • Odin : yet another AWS deployment solution. But this time with step function!! The concept is rather ingenious. And I wish we could literally script cloud formation deployment cycle using step function and lambda. There is probably a way using the WaitCondition and stuff. But it seems extremely convoluted. In the meantime kudos to Coinbase, I really like it and hope to adopt some of Odin mechanism myself.
  • Binci : containerize development workflow with docker. Personally, I have used docker compose to do it. I have a little bit of an issue with npm/JS as the core language for these type of solution. But all in all, containerizing your development environment and workflow should be standard practice by now in the industry. Sadly it is isn't yet. 
  • cloud-nuke : want to wipe your AWS/Azure/GCP resources associated with an account, use cloud-nuke. Alternative solution: aws-nuke Nuka-Cola sold separately...





Thursday, November 08, 2018

[Links of the Day] 08/11/2018 : large scale study of datacenter network reliability, What to measure in production, Failure Mode effect Analysis

  • A Large Scale Study of Data Center Network Reliability : the authors study reliability within and between Facebook datacenters. One of the key findings is the growth in complexity, heterogeneity and interconnectedness of datacenter increase the rate of occurrence of unwanted behaviours. Moreover, this seems to be also a key potential limiting factor for world scale spanning infrastructure undergoing rapid organic growth.
  • Understanding Production: What can you measure? : what do you need to monitor and measure in production. Very good summary of many blog post out there.
  • Failure Mode Effects Analysis (FMEA) : once you start reaching a certain production scale and more stringent requirement kicks in ( unless you were unlucky enough to have them at the get-go). You might want to run a failure modes and effects analysis (FMEA) is a step-by-step approach for identifying all possible failures in a design, a manufacturing or assembly process, or a product or service. While it was mainly designed to address shortcomings in the manufacturing industry, it is still extremely useful for IT system analysis, especially when you want to prepare yourself pre-rollout of a chaos monkey like system.


Tuesday, November 06, 2018

[Links of the Day] 06/11/2018 : Intro to probabilistic programming, Unit tests for data, Ali Wong stand-up routing analysis


  • An Introduction to Probabilistic Programming: a first-year graduate-level introduction to probabilistic programming. It not only provides a thorough background for anyone wishing to use a probabilistic programming system but also introduces the techniques needed to design and build these systems.
  • deequ : library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
  • Ali Wong structure of stand up comedy : fantastic and beautiful designed article analysing Ali wong stand up routine and how it is closer to a tv/movie script than a slapstick one-line joke comedy.


Thursday, November 01, 2018

[Links of the Day] 01/11/2018 : multi-platform, multi-architecture CPU emulator framework , Periodic Table of Data Structures, Model-Based Machine learning book

  • Unicorn : a qemu offshoot offering a lightweight multi-platform, multi-architecture CPU emulator framework. 
  • Periodic Table of Data Structures : the author try to express the different data structure used in computer science using a universal model. Once completed, formalized and validated this could be invaluable help for accelerating decision making processing for software architecture design. Now if somebody could do the same for software project management...
  • Model-Based Machine learning book : this book is trying to teach you how to think like a statistician to solve some nontrivial analytical problems. Not many new books cover this skillset, and just for this alone, this book is a must-read for anybody using machine learning.