Skip to content

DSB #110

Hi,

그것은금요일 and weekend and bulletin! Last volume was amazing and this one is amazing as well. For me it was interesting to learn about vokenization from Papers & Books or read about meta-loss landscapes in Analytical. But every article is full of interesting info. So go for it.

And as always, enjoy your reading.

Analytical

http://lukemetz.com/exploring-hyperparameter-meta-loss-landscapes-with-jax/ – Have you ever given a thought to exploration of hyperparameter meta-loss landscapes? No? Then now is the time to find out if there are better solutions than the gradient based method. (rcmd by reader)

https://github.com/norvig/pytudes/blob/master/ipynb/Advent-2020.ipynb – Breathtaking solutions for Advent of Code 2020 in Python. (rcmd by reader)

https://www.tradewithscience.com/practical-portfolio-optimization-in-python-1-3-markowitz/ – Our former colleague wrote a series of articles about portfolio optimization in Python. First two articles are more theoretical, the third one includes code. (rcmd by reader)

Computer Science & Science

https://colab.research.google.com/github/hasktorch/hasktorch-jupyter/blob/master/hasktorch_haskell_engine.ipynb#scrollTo=nwAnJwgwK0Js – Is Haskell and Hasktorch a future of datascience? Hopefully not, or…? (rcmd by reader)

https://hackernoon.com/you-train-it-you-run-it-tx1m35s6 – Data Science is not only about modelling, but also about data engineering. You train it, you run it! Why is that so important, can be found also here. (rcmd by reader)

https://www.unrealengine.com/en-US/blog/a-sneak-peek-at-metahuman-creator-high-fidelity-digital-humans-made-easy – Epic introduced their MetaHuman Creator where you can in less than hour create a digital human of pixel perfect quality.

Graphs and Visualizations

https://www.topbots.com/graph-neural-networks-applications-2021/ – Graphs neural networks should be noticed – the link will tell you how to apply them in practice. (rcmd by reader)

http://www.theorangeduck.com/page/machine-learning-kolmogorov-complexity-squishy-bunnies – Kolmogorov complexity and squishy bunnies explained by math and nice vizualizations.

https://www.nytimes.com/2021/02/06/science/tectonic-plates-continental-drift.html – Simulation of tectonic plates movement.

Business and Career

https://www.springboard.com/blog/data-science-talent-shortage/ – Data scientists are still in demmand and still there is a shortage of them. Why is that and how to handle it is answered in the article.

https://theintercept.com/2021/01/31/fintech-biden-nominees-robinhood/ – Robinhood showed that fintechs are powerfull and maybe, (un)fortunately, we may expect some regulations. Not only because of the stock market but also because of shadow banking and more.

https://thefinancialbrand.com/107353/fintech-big-tech-partnership-trend-bank-credit-union-digital-innovation/ – I love it, this article shows differences in thinking between big finance institutions like banks and fintechs. Put it shortly, to have mobile apps is not enough.

Pop

https://thedig.substack.com/p/eggs-basket-the-hidden-and-not-so – Why big companies like Tesla invest into Bitcoin? Why, why, why? Yes, there are reasons.

https://tedunderwood.com/2021/02/02/why-sf-hasnt-prepared-us-to-imagine-machine-learning/ – I really like this parable of ML to Library of Babel – ML enhances our world, creates more possibilities, but where does it lead?

https://dropbox.tech/machine-learning/cannes–how-ml-saves-us–1-7m-a-year-on-document-previews – Dropbox saved 2 mil. USD thanks to optimizing documents preview.

Education

https://www.python.org/dev/peps/pep-0636/ – Match statement for pattern matching is new for Python 3.10. The quickest intro is in Appendix A. (rcmd by reader)

https://pritesh-shrivastava.github.io/blog/2021/02/09/matrix-transformation – Linear transformation of 2D space using matrices, simply and intuitively with great visualizations + code in Python. (rcmd by reader)

https://old.reddit.com/r/MachineLearning/comments/lgsgz8/d_deep_learning_theory/ – This is gold. Reddit thread about deep learning theory with many interesting sources.

Data & Libraries

https://github.com/python-trio/trustme –  Generate fake TLS certs to use in your tests with Python library trustme. (rcmd by reader)

https://github.com/microsoft/qlib – qlib is a python library by Microsoft for AI quantitative investing. 

https://github.com/ebhy/budgetml – Quickly deploy your models with BudgetML – seems interesting for particular utilization when you don’t care about details.

Video & Podcast

https://www.codenewbie.org/podcast/what-s-a-container – Podcast about containers – Kubernetes, Heroku, Digital Ocean and many more. (rcmd by reader)

https://rstudio.com/resources/rstudioglobal-2021/ – rstudio::global 2021 is a conference that was held in January and lectures are already online. In one of them, for example, Michael Chow transformed Tidyverse to Python. (rcmd by reader)

https://www.youtube.com/watch?v=8Ko3TdPy0TU&feature=share – Is it possible to be so lucky, that it is virtually impossible? (rcmd by reader)

Papers & Books

https://arxiv.org/abs/2010.06775 – Learn these new terms: vokenization and vokens (visual tokens). You convert the language tokens to related images and achieve a visual supervision of the language model. Code is here. (rcmd by reader)

https://arxiv.org/abs/1803.02349 – Graph embedding of commodities for recommendation system in Alibaba. (rcmd by reader)

https://paperswithcode.com/paper/zero-offload-democratizing-billion-scale – One serious optimization of GPU computation with help from CPU. It’s almost unbelievable. With one GPU you can achieve the same speed as with multiple GPUs before. Code is here.

Behind the Fence

https://ai-jobs.net/job/5234-data-scientist/ – Data scientist in Air-Conditioning, Heating, and Refrigeration Institute, Arlington, Virginia, USA.

Joke

https://www.reddit.com/r/ProgrammerHumor/comments/li9obu/husband_does_good_maths/ – Oh my 😀 Scroll down to see others, they are hilarious.

One Comment

Leave a Reply