Skip to content

DSB #113

Hi,

winter is hopefully gone, friday sliding is over, but spring is here and also the DSB! I would recommend an article about color scales from Graphs and Visualizations or (not so) funny FB prevarication about their models and their effects on society.

As always, enjoy your reading.

Analytical

https://blogboard.io/blog/data-science-in-marketing-optimization/ – Several case studies from companies like Airbnb, Netflix, Lyft and others about marketing optimization.

https://syamkakarla.medium.com/deep-learning-for-land-cover-classification-of-satellite-imagery-using-python-e7ca9f7bfa0a – 3D-CNN for land cover classification of satellite imagery using Python – includes code.

https://yeqiuu.medium.com/the-heartfelt-story-of-me-building-a-league-of-legends-win-interpreter-for-hard-stuck-silver-ii-36684c99facc – Win interpreter for League of Legends with Pyhon code.

Computer Science & Science

https://levelup.gitconnected.com/hidden-power-of-polymorphism-in-python-c9e2539c1633Polymorphism in Python. (rcmd by reader)

https://github.com/tuvtran/project-based-learning#python – List of multiple projects done not only in Python. Top notch resource perfect for inspiration. (rcmd by reader)

https://blog.trailofbits.com/2021/03/15/never-a-dill-moment-exploiting-machine-learning-pickle-files/ – If you use pickle
format maybe you should learn how it works.

Graphs and Visualizations

https://blog.datawrapper.de/which-color-scale-to-use-in-data-vis/ – Which color to use when? And when to use qualitative scale and when quantitative one. (rcmd by reader)

https://themarkup.org/citizen-browser/2021/03/11/split-screen – Facebook content differs for each group of users. This visualization shows who sees what. (rcmd by reader)

https://pudding.cool/2021/03/wine-model/ – This is amazing, I really love beautiful visualizations and this one is more than that. It’s playful, interesting and contains math about wine. Couldn’t ask for more.

Business and Career      

https://core.hubuc.com/banking-as-a-service-examples/ – Banking as a plug-and-play service for non-financial entities. (rcmd by reader)

https://www.nytimes.com/2021/03/17/opinion/ai-employment-bias-nyc.html – There is a potential of sexism and rasism by hiring technology that probably needs to be regulated. (rcmd by reader)

https://blog.ploeh.dk/2021/03/22/the-dispassionate-developer/ – This article is about developers but it also applies to data scientists. These positions are expected to learn and study in their free time continuously and perpetually. And in whose interest is that?

Pop

https://www.technologyreview.com/2021/03/11/1020600/facebook-responsible-ai-misinformation/ – Facebook itself knows that their models that maximize engagement increase polarization. The problem is quite complex, because if you want to be fair, you have to be allowed to lose money. Of course FB is afraid of regulations therefore they are proposing their own solution that would basically only protect them from competition – typical behaviour of monopoly. (rcmd by reader)

https://www.microsoft.com/en-us/worklab/work-trend-index/hybrid-work – Hybrid work seems inevitable and Microsoft provides multiple interesting insights and visualizations on the topic. (rcmd by reader)

https://www.sfchronicle.com/business/article/GitLab-S-F-s-remote-work-pioneer-has-advice-16044687.php – But there are also opinions that hybrid working is the worst option for everybody.  (rcmd by reader)

Education

https://blog.earthly.dev/compiling-containers-dockerfiles-llvm-and-buildkit/ – What does it mean to compile a container? How does it work? (rcmd by reader)

https://chris-said.io/2021/03/13/instrumental-variables – Instrumental variables for non-economists. (rcmd by reader)

https://www.kdnuggets.com/2021/03/top-youtube-machine-learning-channels.html – Lists of YouTube Machine Learning Channels and on the first place is sentdex, hence it’s a good list.

Data & Libraries

https://venturebeat.com/2021/03/18/the-great-data-decentralization-is-coming-are-you-ready/ – What is the new trend called data decentralization? (rcmd by reader)

https://pymde.org/ – Python library PyMDE for computing vector embeddings of items. (rcmd by reader)

https://eng.uber.com/ubers-journey-toward-better-data-culture-from-first-principles/ – Principles for proper data culture by Uber.

Video & Podcast

https://www.youtube.com/watch?v=SB-qEYVdvXA – Eleven minutes of cute kittens, nothing more, nothing less… (rcmd by reader)

https://www.youtube.com/watch?v=06-AZXmwHjo – The one and only Mr. Ng generally about AI and its future. (rcmd by reader)

https://www.youtube.com/playlist?list=PL2UML_KCiC0UlY7iCQDSiGDMovaupqc83 – Course of Applied Machine Learning by
Cornell Tech

Papers & Books

https://arxiv.org/abs/2004.13301 – Garbage collector that autonomously learns over time when to perform collections. What could go wrong? (rcmd by reader)

https://arxiv.org/abs/2103.02559v2 – Paper about Minimum-Distortion Embedding implemented in PyMDE. (rcmd by reader)

https://genomebiology.biomedcentral.com/articles/10.1186/s13059-016-1044-7 – Bizare article about automatic conversion of gene symbols to dates and floating-point numbers in Excel that mess-up with at least 704 published papers. (rcmd by reader)

Behind the Fence

https://www.hellofresh.com/careers/listings/2931086?country=us – Principal Data Scientist at HelloFresh in Chicago, USA.

Joke

https://www.monkeyuser.com/2021/trolley-conundrum/Trolley problem 😀

Be First to Comment

Leave a Reply