Showing posts with label NLP. Show all posts
Showing posts with label NLP. Show all posts

Tuesday, March 10, 2020

[Links of the Day] 10/03/2020 : NLP models platform for elasticsearch, Encrypted Tensor flow framework, Reformer transformer machine learning model

  • nboost : scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms (i.e. Elasticsearch)
  • tf-encrypted : encrypted tensor flow. This allows you to work on an encrypted dataset for generating models. It's privacy (??) preserving machine learning framework [github]
  • reformer : while most transformers are limited to a short number of tokens (512.. maybe more). Google folks came up with a new architecture called Reformer that leverage locality preserving hashing that blast past this limation and a allow handling context windows of up to 1 million words, all on a single accelerator and using only 16GB of memory.[arxiv]


Tuesday, December 10, 2019

[Links of the Day] 10/12/2019 : Company culture syllabus, Visual Guide to BERT, Consensus algorithm




Thursday, November 21, 2019

[Links of the Day] 21/11/2019 : NLP reference book, Persistent Memory test, HPC programming language

  • Natural Language Processing : an excellent book covering NLP traditional statistical methods, linear models & context-free parsing right through to modern neural network-based approaches.
  • PMTest : presistent memory test framework. I feel strongly that this is sorely needed as non-volatile tech is finally reaching the mass scale adoption in the industry. And I fear that some devs might wake up one day with a bad surprise when they realise that their PM library is not as crash consistent as advertised.
  • Chapel : Cray HPC programming language. I really wonder how hard would it be to port it / use it on traditional cloud infrastructure.

Thursday, December 06, 2018

[Links of the Day] 06/12/2018 : NLP summarisation, RPC protobuf framework, API security best practices

  • Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting : I would have mentioned this paper on the sole basis that the authors provide a GitHub repo with all the code used. This should be mandatory for any publication in Computer science. Anyway, the summarization tech described in the paper is pretty cool too. [github]
  • twirp : RPC framework with protobuf service definitions. If you don't want to go all gRPC, give this framework a serious look. I would consider twirp over gRPC for the sole reason that it uses the standard GO http server over the custom google one. Seriously google, why did you have to re-implement the HTTP server from your own language ?? 
  • API Security Best Practices : documentations providing some good security practice when it comes GitHubhub usage. As well as an excellent leak management document describing a fairly efficient process.


Tuesday, June 19, 2018

[Links of the Day] 19/06/2018 : Facebook network balancer, Open policy agent, Intel NLP libs

  • OPA : an open source policy agent that decouple policy from actual code logic. This is essential to provide great flexibility with fine-grained control of resources. These kinds of features are a key building block for secure and robust API based solution. [github]
  • Katran : facebook scalable network load balancer. It relies on eBPF and XDP from the Linux kernel to deliver impressive performance at low-cost thanks to its capability to run on off the shelf hardware. [github]
  • NLP Architect : Intel NLP library and solution. Sometimes I feel that Intel has some great hardware and software but the release cycle is rather decoupled. Which often leave the user in an odd situation, where the hardware is out but the software is not there yet. And sometimes it's the opposite. I really feel that Intel should work on this. Maybe externalise the software to a separate entity as the hardware culture might be impeding the software side of the company.


Thursday, June 14, 2018

[Links of the Day] 14/06/2018 : GDPR documentation template, Survey of Vector representation of meanings, Supervised learning by quantum neural networks


  • A Survey on Vector Representations of Meaning : the papers present an overview of the current state of word vector model research space. The survey is quite useful when you need to choose a vector model for your NLP application as each model comes with different tradeoffs.
  • EverLaw GDPR documentation Template: Highly practical and down to earth document helping you classify your current status regarding GDPR and understand what exposure you have to it. To some extent, this is almost a must fill the first step for any company out there that deals with individuals information. 
  • Supervised learning by Quantum Neural Networks:  what's better than neural networks? Quantum neural networks !!! 




Thursday, May 17, 2018

[Links of the Day] 17/05/2018 : Edge Computing and the Red Wedding problem, Vector Embedding utility , Scalability efficiency


  • Towards a Solution to the Red Wedding Problem : interesting look at how to handle massive Read spike while being able to update (write spike ) the content at the same time. The authors propose to leverage edge computing to spread and limit the impact of a write-heavy spike in such network
  • Magnitude : this is a really cool project for those out there dabbling with NLP and vector embedding. This package delivers a fast, efficient universal vector embedding utility.
  • Scalability! But at what COST? : the authors of this paper introduce the concept of measuring the scalability performance of a solution by comparing it to the hardware configuration required before the platform outperforms a competent single-threaded implementation. As always, and often, most system and company do not need a monstrous cluster to satisfy their need. But it's always more glamorous to say: "we used a cluster" rather than: "I upgraded the RAM so the model can fit in memory".




Thursday, November 02, 2017

[Links of the Day] 02/11/2017 : Probabilistic programming Library, Better than Word2Vec , Serverless conf

  • Edward : Turing complete deep probabilistic programming software library. Probabilistic computing is going to be at the forefront of the next big AI improvement wave. [software]
  • Better than Word2Vec : first word2vec is great and there is already so many pre-built libraries out there that it should be your number 1 go to approach. Then if you want to develop custom word embedding library, as the blog post explains, SVD might be a better approach. 
  • ServerlessConf NYC 2017 : a good summary of New York serverless conference  2017. 







Tuesday, March 22, 2016

[Links of the day] 22/03/2016: VPN chart, Hyperloglog in real life, NLP

  • VPN comparison chart : if you need VPN with certain characteristics, this chart if for you. 
  • HyperLogLog in practice : google looked at hyper log log (one of my favorite probabilistic datatructure) and optimized it slightly, they came up with a improved version. Great paper with good insight on how to use HLL in real world and make it work even better.
  • Deep or Shallow, NLP is Breaking Out : natural langage processing is getting boost from deep and shallow learning. We already saw the result with Skype real time translation, Siri and Cortana. Its just a matter of time before we start to see more and more connected devices with NLP capabilities. 


Tuesday, January 27, 2015

Links of the day 27 - 01 - 2015

Today's links 27/01/2015: #NLP library, Queuing Theory, #NVM in #HPC
Mem0r1es

Wednesday, October 22, 2014

Links of the day 22 - 10 - 2014

Today's links 22/10/2014: Twitter #NLP, Neural Turing Machine , Mesos, FB architecture

  • Tweet NLP : twitter natural language processing. It will help you understand and translate your teenage daughter tweet ( by Carnegie Mellon University)
  • Neural Turing Machines : combination of neural network approach with  Turing Machine or Von Neumann architecture by Google 
  • Mesos Deep Dive : architecture deep dive into Apache Mesosphere
  • FB architecture : Facebook and the KISS approach to datacenter

Monday, August 25, 2014

Links of the day 25 - 08 - 2014


Today's link : NLP, Kubernetes, CAP , text editor