Skip to content

DSB #128

Hi,

the first volume of this year comes on Sunday evening, and brings lot’s of reading. I found the article about logging from Computer Science & Science very practical and maybe more generally I would recommend focusing on web 3.0 which is discussed almost everywhere and also in our Pop category.

And as always, enjoy your reading.

Analytical

https://endcrawl.com/credits-ordering/Endcrawl prepares movie credits. And they order them with the help of graph theory. (rcmd by reader)

https://evjang.com/2021/12/17/lang-generalization.html – Generalization hase been an important topic in ML for years. How do we define it for language models?

https://towardsdatascience.com/exploring-the-nft-transaction-with-neo4j-cba80ead7e0b – Quite nice and simple exploratory graph analysis of 6 milion NFT transactions with code. (rcmd by reader)

Computer Science & Science

https://sobolevn.me/2020/03/do-not-log – To log, or not to log? And what are the alternatives when ordinary logging is unsuitable? (rcmd by reader)

https://www.morling.dev/blog/whats-in-a-good-error-message – Recoginze good error messages and learn some best practices on how to write them.

https://davidamos.dev/three-things-you-might-not-know-about-numbers-in-python/ – Did you know that numbers in Python have methods or a hieararchy? And that’s not all…

Graphs and Visualizations

https://neuripsav.vizhub.ai/blog/ – Investigate the papers published at NeurIPS in the last 35+ years. Full of inspirational ideas on visualizations.

https://observablehq.com/@tomlarkworthy/notebooks2021 – Wow, not only links, but also description of 100 beautiful notebooks of last year. Another inspirational article.

https://twitter.com/abmakulec/status/1479496579040034822 – It takes skill to create a great graph, in this twitter thread you can read and see some differences between an average graph and an amazing graph.

Business and Career      

https://medium.com/serious-scrum/scaling-is-easy-if-you-just-let-go-8665e15b02c7 – Scaling in Agile, how to handle complexity of the whole process and when you should just avoid it. “Agile is primarily about empowering those responsible for creating value... so that they can deal with the scaling processes and tools themselves.” (rcmd by reader)

https://mikkeldengsoe.substack.com/p/data-to-engineers – Can you cluster and distinguish companies by data to engineers ratio? Yes, you can and you will get an interesting story.

https://www.coindesk.com/business/2022/01/12/us-banks-form-group-to-offer-usdf-stablecoin – U.S. banks plan to offer their own stablecoin, called USDF. Question is whether it will be insured or not.

Pop

https://www.coindesk.com/layer2/2022/01/14/web-3-is-a-long-fight-worth-fighting/ – Is Web 3 a real thing? And what can we expect from this so-called revolution?

https://spectrum.ieee.org/artificial-intelligence-2021 – Top 10 articles about AI from last year. Some of them indicate sobering from uncritical hype. (rcmd by reader)

https://www.vice.com/en/article/m7v79v/notorious-mafia-fugitive-caught-chilling-on-google-street-view – After 20 years an
Italian mafia boss was arested thanks to Google Street view. (rcmd by reader)

Education

https://sirupsen.com/napkin/neural-net – Build your own neural network from scratch! I like these hands-on tutorials.

https://hackernoon.com/adversarial-machine-learning-a-beginners-guide-to-adversarial-attacks-and-defenses – Intro on
adversarial attacks and defenses. (rcmd by reader)

https://jxmo.io/posts/variational-autoencoders – Exhausting intro to variational autoencoders, mainly theoretical with lot’s of equations.

Data & Libraries

https://benn.substack.com/p/entity-layer – Change your data warehouse into centralized operational brain with entity layers and reverse ETL.

https://dapr.io/ – Dapr = The Distributed Application Runtime. It provides APIs that handle microservice connectivity. (rcmd by reader)

https://www.visidata.org/ – VisiData is an open-source tool for datasets handling in terminal. Demo video is here. (rcmd by reader)

MLOps

https://blog.devgenius.io/why-google-treats-sql-like-code-and-you-should-too-53f97925037e – Data Engineers at Google treat SQL the same way Software Engineers treat code. Which of course does not mean only obvious tools like Git, but mainly standardized approaches.

https://eugeneyan.com/writing/system-design-for-discovery/ – See how the system designs for industrial recommendations and search works. Examples from companies like Alibaba, Facebook and more. (rcmd by reader)

https://huyenchip.com/2022/01/02/real-time-machine-learning-challenges-and-solutions.html – Design of real-time ML. What can go wrong and how to handle it.

Video & Podcast

https://youtu.be/z5slE_akZmc – The 2021 AI rewind in 15 minutes.

Papers & Books

https://arxiv.org/abs/2201.00650 – Deep Learning Interviews is a book, or rather an inventory of numerous job interviews and exams.

https://janvanhaaren.be/2021/12/30/soccer-analytics-review-2021.html – 50 reaserch papers, 40 blog posts, 11 new articles, 9 webinars, 8 events and 3 podcasts just about soccer analytics.

https://medium.com/paperswithcode/papers-with-code-2021-a-year-in-review-de75d5a77b8b – The top trending papers, libraries and datasets for 2021 on Papers with Code.

Behind the Fence

https://apply.workable.com/fabulousco/j/4AE7D1737E/ – Senior Analytics Engineer at Fabulous in Paris, France.

Joke

https://i.redd.it/hofafbdq5yb81.png

Be First to Comment

Leave a Reply