Skip to content

DSB #102

Hi,

end of the week brings bulletin – at least today! This volume is much more about data engineering than data science, you can start with roadmap for a modern data engineer in Business and Career. Personally, I will definitely have a look at course about mathematical modelling of soccer.

As always, enjoy your reading.

Analytical

https://info.crunchydata.com/blog/recommendation_engine_in_postgres_with_pandas_and_python – Python is everywhere, now even in Postgres and you can build a recommendation engine this way. (rcmd by reader)

https://www.jeremyjordan.me/testing-ml/ – How to test a ML. Text by a core maintainer for PyTorch Lightning. He was inspired by this amazing paper. (rcmd by reader)

https://github.com/huggingface/pytorch_block_sparse – Use this extension for PyTorch and replace your dense matrices by the sparse ones.

Computer Science & Science

https://towardsdatascience.com/writing-more-idiomatic-and-pythonic-code-c22e900eaf83?gi=3647b4d2e701 – True pythonic code looks like this, at least according to author of the text. (rcmd by reader)

http://trevorjim.com/how-emacs-beat-vi-in-the-editor-wars/ – Funny text about two powerfull editors: vi and Emacs. (rcmd by reader)

https://mgaitan.github.io/logging-talk/logging_talk.slides.html#/ – Presentation about logging in python, how to. Use also up and down arrows to navigate through the presentation. (rcmd by reader)

Graphs and Visualizations

https://venturebeat.com/2020/09/03/ai-researchers-use-heartbeat-detection-to-identify-deepfake-videos/ – How to identify a deepfake video? Yep, there are the human blood movements.

https://www.analyticsvidhya.com/blog/2020/09/18-open-source-computer-vision-projects-beginners/ – If you wanna play with computer vision, read this one about open source computer vision projects.

https://www.nytimes.com/interactive/2020/09/02/upshot/america-political-spectrum.html – Colours of America like you have never seen before. And how are they connected to political beliefs.

Business and Career     

https://github.com/datastacktv/data-engineer-roadmap – Roadmap for a modern data engineer, what one should know? (rcmd by reader)

https://www.finews.com/news/english-news/42854-rothschild-study-revolut-n26-monzo-nubank-corona-client-data – The short summary of a study about digital banks like N26, Revolut or Monzo. What can possibly stop them?

https://thefinancialbrand.com/102021/wealthfront-robo-investing-millennial-fintech-banking-transformed-podcast-rachleff/ – Learn about Wealthfront, a fintech company that is suposed to help you invest, of course with help of technology.

Pop

https://thenextweb.com/neural/2020/09/03/tinyml-is-breathing-life-into-billions-of-devices/ – TinyML enables implementation of machine learning (ML) in ultra-low power systems. Will it be new hype? (rcmd by reader)

https://deepmind.com/blog/article/traffic-prediction-with-advanced-graph-neural-networks – Very high level view on traffic predictions by DeepMind, but still interesting reading.

https://www.theguardian.com/commentisfree/2020/sep/08/robot-wrote-this-article-gpt-3 – Article written by a robot thanks to GPT-3. And there you can read why it’s a load of bollocks.

Education

https://sites.google.com/g.harvard.edu/tinyml/home – And again TinyML and course about it by Harvard.

https://uppsala.instructure.com/courses/28112 – Wow, course about mathematical modelling of football (soccer). Seems interesting not only for fans.

https://www.analyticsvidhya.com/blog/2020/09/how-dbscan-clustering-works/ – Embrace the unsupervised DBSCAN Clustering Algorithm.

Data & Data Mining

https://app.ignitionrobotics.org/GoogleResearch/fuel/collections/Google%20Scanned%20Objects – If you need a 3D representation of common household objects, this is a database for you! Created by Google Research.

https://github.com/EthicalML/awesome-production-machine-learning/ – Amazing repository that contains everything you need to manage your ML production.

https://k8syaml.com/ – Generate your own YAML.       

Video & Podcast

https://podcasts.google.com/feed/aHR0cHM6Ly9hbmNob3IuZm0vcy8zMGQ3ZDlmMC9wb2RjYXN0L3Jzcw== – A new podcast by Jacqueline Nolis a Emily Robinson. Authors of this book about career in data science. (rcmd by reader)

https://www.wandb.com/podcast/anthony-goldbloom – Anthony Goldbloom is the founder and CEO of Kaggle and he tells you not only how Kaggle has changed but much more.

https://www.youtube.com/watch?v=9mS1fIYj1So – Tunning peformance of PyTorch just by changing a few lines of code.

Papers & Books

https://github.com/gerdm/prml – Learn about many algorithms in a book completely created in jupyter. (rcmd by reader)

http://www.stat.columbia.edu/~gelman/research/unpublished/forecast_incentives3.pdf – Very soon more than relevant paper about election predictions, how to handle and understand the models.

Behind the Fence

https://careers.jetblue.com/job/Long-Island-City-Data-Scientist-NY-11101/674588100/ – Data Scientist in jetBlue, New York, USA.

Joke

https://i.redd.it/zzxvhomgaam51.jpg

One Comment

Leave a Reply