Friday is finally here and also the DSB! For me the most interesting link in this DSB is the one about hardware lottery in Computer Science. Also for those who are into Rasa, the link in Education gives you opportunity to take a course on Udemy for free. But please, do not use it if you are not truly interested. Thanks 😉
And as always, enjoy your reading.
https://multithreaded.stitchfix.com/blog/2020/09/24/what-makes-a-good-estimator/ – How to recognize a good estimator? Learn about unbiasedness, consistency, asymptotic normality, efficiency and robustness.
https://mindee.com/documentation/apis/tutorial-parsing-receipts-with-mindees-api/python – Use Mindee’s API to parse and extract data from receipts. Developer account is for free. (rcmd by reader)
https://www.brodrigues.co/blog/2020-09-05-tidytable/ – Do you like tidyverse, but you want be as fast as data.table? Then tidytable is exactly what you need! (rcmd by reader)
Computer Science & Science
https://hardwarelottery.github.io/ – Impressive long reading about hardware lottery, a problem when your solution is the best one there is just because it suits the hardware and software you are using. It is written by Sara Hooker, a researcher at Google Brain.
https://www.analyticsvidhya.com/blog/2020/09/what-is-aws-amazon-web-services-data-science/ – A brief overview of AWS services for those who do not have any idea yet. (rcmd by reader)
https://medium.com/analytics-vidhya/why-gpus-are-more-suited-for-deep-learning-3f823d66f5e4 – We all know that GPU is better for deeplearning than CPU, but why?
Graphs and Visualizations
https://www.pipinghotdata.com/posts/2020-09-22-exporting-editable-ggplot-graphics-to-powerpoint-with-officer-and-purrr/ – ggplot is the best – convince me otherwise – now you can even export editable plot to PowerPoint. (rcmd by reader)
https://www.kdnuggets.com/2020/09/geographical-plots-python.html – Geographical plots with plotly and cufflinks in Python.
Business and Career
https://bdtechtalks.com/2020/09/24/microsoft-openai-gpt-3-license/ – Microsoft is licensing GPT-3 by OpenAI (more about it also here) and the question is what does it mean for OpenAI itself or for other researchers? (rcmd by reader)
https://hbr.org/amp/2019/01/the-hard-truth-about-innovative-cultures – Older article about innovative cultures, how to create one and which principles are essential. (rcmd by reader)
https://www.irozhlas.cz/zpravy-domov/miliardy-korun-policie-ncoz-banka-prani-spinavych-penez_2009290600_onz – Naprosto bizarní zpráva o zesnulé ženě, která do bank skrze směnárny dostala 12 mld. Kč…
https://www.stateof.ai/ – Last year in DSB #75 we shared a report about AI in 2019. Now you can read updated version of this comprehensive report for current funny year 2020. (rcmd by reader)
https://thenextweb.com/neural/2020/09/04/this-ai-predicted-who-will-play-the-next-james-bond-hint-toss-a-coin-to-him/amp/ – AI predictions who will be the next James Bond. (rcmd by reader)
https://ceskapozice.lidovky.cz/tema/meli-bychom-rozlisovat-praktickou-a-teoretickou-umelou-inteligenci.A200915_213817_pozice-tema_lube – Rozhovor s profesorem z ČVUT Vladimírem Maříkem, expertem na robotiku, který zde hovoří o AI, robotizaci a návazných tématech nejen pohledem ČR. (rcmd by reader)
https://www.udemy.com/course/rasa-certification-workshop/?couponCode=CONTRIBUTORS-83F3EF – Please, only for those whore are really interested in Rasa, you can use this free ticket to take a course on Udemy. It is valid only for two days, so don’t miss this opportunity, if you are interested! First come, first served. (rcmd by reader)
https://github.com/ForrestKnight/open-source-cs-python – Make your own computer science bachelor degree with these free courses, I bet you recognize many of them. Author of the list explains the idea here in the video. (rcmd by reader)
https://blog.tensorflow.org/2020/09/introducing-tensorflow-recommenders.html – Intro to recommenders in TensorFlow.
Data & Data Mining
https://www.r-bloggers.com/2020/08/introducing-collapse-advanced-and-fast-data-transformation-in-r/ – Fast data transformation in R with collapse, C/C++ based package. (rcmd by reader)
https://medium.com/northzone/unpacking-the-data-hype-8c3a0ae63564 – Data is the new oil, whatever, but what is truly important in case of data in your company?
https://www.microsoft.com/en-us/research/project/coax-rl/ – Just introducing coax, a modular reinforcement learning Python package.
Papers & Books
https://www.nature.com/articles/s41467-020-18190-5 – Authors review the different applications for mobile phone data in guiding and evaluating COVID-19 response.
Behind the Fence
https://careers.ap.org/job/New-York-Data-Science-Manager-NY-10281/676334500/ – Data Science Manager in New York, USA.
https://devhumor.com/content/uploads/images/September2020/sql_programmers.jpg – I press the Shift, though…
It has been a long time since we had the hacking time! This task was recommended by reader.
- just two lists:
a = [“__asd”, “sdfsd”, “__Asfda”, “__asfsdfsdf”, “dfs”, “dfd”]
b = [“O”, “X”, “O”, “X”, “O”, “O”]
- It is a “conditional reduce”, when the dictionary elements “a” and “b” are merged to words/letters.
- The merge takes place between the element i and ii, unless ii begins with “__”.
[(‘asdsdfsd’, [“O”, “X”]), (‘Asfda’, [“O”]), (‘asfsdfsdfdfsdfd’, [“X”, “O”, “O”])]