Thursday, February 12, 2015

Links of the day 12 - 02 - 2015

Today's links 12/02/2015: #bigdata algorithm, probabilistic programming, Start-up resources and RAM myth
  • PPAML : Probabilistic Programming for Advancing Machine Learning program goal tries to facilitate the construction of machine learning applications by using probabilistic programming. 
  • Start-up Resources : a very good list of start-up resources and articles. 
  • The Myth of RAM : Memory access patterns matters.... a lot. This is hardly news and this article higlight it as if you solely use "classical" complexity analisys to evaluate your code you are going to be bitten badly. The main idea is that the memory accesses costs O(√N) and not O(1) in practice.
  • HyperLogLog : technique to estimate the cardinality of a set with cardinality Nmax using just loglog(Nmax) + O(1) bits. Like the Linear Counter the Hyper LogLog counter allows the designer to specify the desired accuracy tolerances. Very good summary of the paper here.