Skip to content

DSB #118

Hi,

and Yabba Dabba Doo! It’s Friday and you can start your weekend with DSB! I definitely recommend the video from Behind the Fence by Netflix, or the article how data science is losing its attractiveness from Business and Career.

As always, enjoy your reading.

Analytical

https://eugeneyan.com/writing/patterns-for-personalization/ – How does personalization works and which groups of methods we can distinguish.

https://towardsdatascience.com/automated-machine-learning-using-pycaret-4bb90ab3e2c7 – AutoML with PyCaret.

https://codeascraft.com/2021/06/02/increasing-experimentation-accuracy-and-speed-by-using-control-variates/ – What
is CUPED and how it can help with more informed product decisions and also fasten the experiments?

Computer Science & Science

https://www.sciencedaily.com/releases/2021/06/210616113818.htm – What kind of errors can a quantum computer have and what happens or means if they are correlated.

https://greg-kennedy.medium.com/attack-of-the-robot-authors-7fb51d4efff6 – Examples of what nowadays are computers able to write. See yourself whether they’re good or not.

https://www.freecodecamp.org/news/system-design-interview-practice-tutorial/ – System desing interview – how to handle it and how to understand a system.

Graphs and Visualizations

https://github.com/mathisonian/awesome-visualization-research – List of data visualizations research papers, books, blog posts, and others. (rcmd by reader)

https://www.analyticsvidhya.com/blog/2021/06/build-user-interface-with-gradio-for-your-deep-learning-project – Bulding
UI for your ML model, function, or API with Gradio.

https://www.analyticsvidhya.com/blog/2021/06/uber-and-lyft-cab-prices-data-analysis-and-visualization/ – Let’s play with
Uber and Lyft data and get an insight with some simple visualizations.

Business and Career

https://medium.com/swlh/why-so-many-data-scientists-quit-good-jobs-at-great-companies-429ea61fb566 – Data science is loosing its appeal, the reality setup by organisation is not meeting with expectations. (rcmd by reader)

https://krebsonsecurity.com/2021/06/how-does-one-get-hired-by-a-top-cybercrime-gang – This is how you get a job for a malware-as-a-service platform.

https://jessitron.com/2021/06/12/the-enterprise-eats-software – Corporate vs good software: “It is possible to build a good software team that can build good software, inside an enterprise. It takes shielding to protect them from the constraints that the company imposes.” (rcmd by reader)

Pop

https://www.sciencemag.org/news/2021/06/are-advertisers-coming-your-dreams – Research about dreams is quite interesting and also disturbingly attractive for companies like Amazon. Imagine your intelligent speaker trying to influence you while you sleep. (rcmd by reader)

https://thegradient.pub/how-has-ai-contributed-to-dealing-with-the-covid-19-pandemic – How did ML helped or not helped with the fight against COVID-19? A really long article but very interesting reading.

https://www.engadget.com/apple-considered-launching-its-own-primary-healthcare-service – Apple is considering its own primary healthcare service and it’s all about the data.

Education

https://huggingface.co/course/chapter1 – Courses by Huggin Face on NLP. (rcmd by reader)

https://www.analyticsvidhya.com/blog/2021/06/support-vector-machine-better-understanding – Quite comprehensive article about SVM.

https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge – Design an agent which can navigate the procedurally generated ascii dungeons in NetHack in this NeurlPS Challenge! Or challenge about Minecraft diamond mining. (rcmd by reader)

Data & Libraries

https://spin.atomicobject.com/2021/02/04/redis-postgresql/ – Redis is a popular Key-Value NoSQL database, yet the PostgreSQL can still be more than enough. (rcmd by reader)

https://cnr.sh/essays/what-the-heck-data-meshData Mesh, a new paradigm in enterprise data architecture.

https://www.allthingsdistributed.com/2021/06/amazon-timestream-time-series-is-the-new-black.html – New AWS service for data streaming (e.g. video) called Timestream. (rcmd by reader)

MLOps

https://twimlai.com/solutions/introducing-twiml-ml-ai-solutions-guide/ – There are so many ML platforms that one can easily lose his/her head. Use this to compare them and find what suits you. (rcmd by reader)

https://gradientflow.com/machine-learning-model-monitoring/ – ML models monitoring, its features and why it’s important.

https://www.analyticsvidhya.com/blog/2021/06/deploy-machine-learning-models-leveraging-cherrypy-and-docker – Deploying a model with CherryPy and Docker. With code, of course.

Video & Podcast

https://www.youtube.com/c/PyConUS – Videos from PyCon US 2021. (rcmd by reader)

https://www.youtube.com/watch?v=LlKAna21fLE – Linear algebra for ML.

Papers & Books

https://paperswithcode.com/paper/probabilistic-gradient-boosting-machines-for – Probabilistic Gradient Boosting Machines (PGBM) beats traditional GBM.

https://paperswithcode.com/paper/programming-puzzles – Python Programming Puzzles (P3), enhance your Python powers!

https://arxiv.org/abs/2005.04305 – Let’s see how algorithmic efficiency of NN is evolving. It doubles every 16 months.

Behind the Fence

https://twitter.com/WeAreNetflix/status/1405609397913481216 – Data Engineer in Netflix. Hilarious video, this is how you do your job advertisement. (rcmd by reader)

Joke

https://external-preview.redd.it/QstcdGExdmeiz6vBraIyAZ-Z56eXIiQF48SvVOu-MAM.jpg?auto=webp&s=2837c1e4fcbc2e4857d8335c66ad65709fffefd8

Be First to Comment

Leave a Reply