<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:cc="http://cyber.law.harvard.edu/rss/creativeCommonsRssModule.html">
    <channel>
        <title><![CDATA[Sicara&#39;s blog - Medium]]></title>
        <description><![CDATA[We build tailor-made AI and Big Data solutions for amazing clients - Medium]]></description>
        <link>https://medium.com/sicara?source=rss----fd4c083fbb93---4</link>
        <image>
            <url>https://cdn-images-1.medium.com/proxy/1*TGH72Nnw24QL3iV9IOm4VA.png</url>
            <title>Sicara&amp;#39;s blog - Medium</title>
            <link>https://medium.com/sicara?source=rss----fd4c083fbb93---4</link>
        </image>
        <generator>Medium</generator>
        <lastBuildDate>Sun, 24 May 2026 02:29:33 GMT</lastBuildDate>
        <atom:link href="https://medium.com/feed/sicara" rel="self" type="application/rss+xml"/>
        <webMaster><![CDATA[yourfriends@medium.com]]></webMaster>
        <atom:link href="http://medium.superfeedr.com" rel="hub"/>
        <item>
            <title><![CDATA[Best of AI: 10 Articles To Read in February 2020]]></title>
            <link>https://medium.com/sicara/best-of-ai-10-articles-to-read-in-february-2020-a8bfa9810b6?source=rss----fd4c083fbb93---4</link>
            <guid isPermaLink="false">https://medium.com/p/a8bfa9810b6</guid>
            <dc:creator><![CDATA[Antoine Toubhans]]></dc:creator>
            <pubDate>Tue, 18 Feb 2020 09:53:16 GMT</pubDate>
            <atom:updated>2020-02-18T09:53:16.531Z</atom:updated>
            <content:encoded><![CDATA[<p>Welcome to the February edition of our best and favorite articles in AI that were published this month. We are a Paris-based company that does Agile data development.</p><p>This month, we spotted among others, articles about AI that can diagnose breast cancer with higher accuracy than experts! Let’s start, as usual, with the comic of the month:</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/578/0*q9-pdU2WDdn7x74n" /><figcaption>Global AI</figcaption></figure><h3>1 — Breast Cancer Diagnosis</h3><p><em>Interpret screen mammography</em></p><figure><img alt="" src="https://cdn-images-1.medium.com/max/660/0*nqQxMqtOSutZ3eLT" /><figcaption><a href="https://www.bbc.com/news/health-50857759">AI was as accurate as two doctors working together</a></figcaption></figure><p>A recent <a href="https://www.nature.com/articles/s41586-019-1799-6">evaluation</a> of a AI system for breast cancer screening concludes that it is capable of surpassing human experts in breast cancer prediction.</p><p>It is essential to identify breast cancer at earlier stages of the disease when treatment can be more successful. Screening mammography is designed to perform such identification but is complex to analyze and lead to false diagnosis: some healthy patients are diagnosed sick (false positive) and some sick patients are diagnosed healthy (false negative).</p><p>This evaluation demonstrated an absolute reduction of 5.7% and 1.2% (USA and UK) in false positives and 9.4% and 2.7% in false negatives and thus surpassing human experts in breast cancer prediction!</p><h3>2 — Here is Meena, the Universal Chatbot</h3><p>On 27th January, a Google brain team introduced <a href="https://deepai.org/publication/towards-a-human-like-open-domain-chatbot">Meena</a>, a new open-domain human-like chatbot, meaning Meena talks about any topic and it mimics the human ability to converse freely in natural language.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/800/0*4hKAtmoWgickhhZ-" /><figcaption>Meena executes a joke :)</figcaption></figure><p>Unlike other state-of-the-art open-domain chatbots (<a href="https://arxiv.org/abs/1709.02349">MILABOT</a>, <a href="https://arxiv.org/abs/1812.08989">XiaoIce</a>, <a href="https://pdfs.semanticscholar.org/b402/b85ad45e3ac51f1da8ee718373082ce24f47.pdf">Gunrock</a>, <a href="https://www.pandorabots.com/mitsuku/">Mitsuku</a>, and <a href="https://www.cleverbot.com/">Cleverbot</a>), Meena is an end-to-end Neural Network approach and do not rely on complex frameworks.</p><p>Traditionally, chatbot performance is measured through <a href="https://en.wikipedia.org/wiki/Perplexity">perplexity</a> which measures how accurately the bot anticipates what people will say next. Interestingly, there is no proof that this measure correlates with the chatbot responses being “human-like”. To alleviate this issue, the authors proposed a new evaluation metric called Sensibleness and Specificity Average (SSA) relying on humans judging how chatbot responses make sense and are specific. Two things came up :</p><ul><li>the best Meena version scores 79% SSA, it outperforms state of the art open-domain chatbots and gets close to human performance (86% SSA)</li><li>SSA and perplexity are strongly correlated: the more Meena responses are specific and accurate, the more it is able to predict people’s next answers. This is reassuring :)</li></ul><p>With these learnings, the authors hope they can get even closer to human capabilities by reducing Meena’s perplexity hence increasing SSA performance.</p><p>Meena network has 2.6B parameters and was trained over 40B words over 30 days using 2048 TPU cores, impressive!</p><p><a href="https://deepai.org/publication/towards-a-human-like-open-domain-chatbot">https://deepai.org/publication/towards-a-human-like-open-domain-chatbot</a></p><p><a href="https://arxiv.org/pdf/2001.09977v1.pdf">https://arxiv.org/pdf/2001.09977v1.pdf</a></p><h3>3 — Creative AI: The Storytelling of AI Dungeon</h3><p><a href="https://www.aidungeon.io/">AI Dungeon 2</a> is an AI-generated text adventure game. Unlike the original AI Dungeon that used an AI text generator to build scenes and choices for the player, the recently released AI Dungeon 2 is different in one major way: instead of the set commands and human-written storylines that traditionally limit player freedom, players of AI Dungeon 2 can type whatever they want. The game responds to the player’s text input thanks to a novel adaptation of <a href="https://openai.com/blog/better-language-models/">GPT-2</a> :</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/800/0*I07WXsUlrZMpGCf4" /><figcaption>Snapshots of AI Dungeon 2 mobile version</figcaption></figure><p>In <a href="https://lionbridge.ai/articles/creative-ai-the-storytelling-of-ai-dungeon/">this blog post</a>, the author tells the story of his adventure as “Henry the Wizard”. In a narrative way, he shares his learnings with us, starting from being skeptical at first and enthusiastic in the end.</p><p><a href="https://lionbridge.ai/articles/creative-ai-the-storytelling-of-ai-dungeon/">https://lionbridge.ai/articles/creative-ai-the-storytelling-of-ai-dungeon/</a></p><h3>4 — HiPlot: High-dimensional Interactive Plots Made Easy by Facebook</h3><p>On January 2020, Daniel Haziza, Jérémy Rapin, and Gabriel Synnaeve from Facebook released HiPlot, an interactive tool that allows exploring high dimensional data.</p><p>Imagine you collected data from multiple trainings: epoch, dropout, embedding size, learning rate and so on. Hiplot let you explore these dimensions in a simple way, using <a href="https://en.wikipedia.org/wiki/Parallel_coordinates">parallel plots</a>:</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/800/0*AAWSzYQ4uAKVBt-A" /><figcaption>HiPlot visualization: parallel plots show dimensions along the “x” axis.</figcaption></figure><p>HiPlot is interactive, you can select the data you want to drill down by clicking on it. And it is really simple to install/use, pip install it and give it a try!</p><p><a href="https://ai.facebook.com/blog/hiplot-high-dimensional-interactive-plots-made-easy/">https://ai.facebook.com/blog/hiplot-high-dimensional-interactive-plots-made-easy/</a></p><h3>5 — The Arrival of a Train at La Ciotat Station (1895) in Full-HD</h3><iframe src="https://cdn.embedly.com/widgets/media.html?src=https%3A%2F%2Fwww.youtube.com%2Fembed%2F3RYNThid23g%3Fstart%3D1%26feature%3Doembed%26start%3D1&amp;display_name=YouTube&amp;url=https%3A%2F%2Fwww.youtube.com%2Fwatch%3Fv%3D3RYNThid23g&amp;image=https%3A%2F%2Fi.ytimg.com%2Fvi%2F3RYNThid23g%2Fhqdefault.jpg&amp;key=a19fcc184b9711e1b4764040d3dc5c07&amp;type=text%2Fhtml&amp;schema=youtube" width="854" height="480" frameborder="0" scrolling="no"><a href="https://medium.com/media/d705dccb119ed51b774d1b401c217276/href">https://medium.com/media/d705dccb119ed51b774d1b401c217276/href</a></iframe><p><a href="https://www.imdb.com/title/tt0000012/">“The Arrival of a Train at La Ciotat Station”</a>, one of the first movies ever, was produced by <a href="https://en.wikipedia.org/wiki/Auguste_and_Louis_Lumi%C3%A8re">Lumière’s brothers</a> in 1895. The story goes that when the film was first shown, the audience was so overwhelmed by the moving image of a train coming directly at them that people screamed and ran away! It is probably a cinema myth, though :)</p><p>As you can imagine, the movie has aged a little bit. Hopefully, <a href="https://www.youtube.com/channel/UCD8J_xbbBuGobmw_N5ga3MA">Denis Shiryaev</a> ran a couple of neural-network-based algorithms to improve the situation:</p><ul><li>it upscales the input video up to 4K definition, using <a href="https://topazlabs.com/gigapixel-ai/ref/291/">the GigaPixel AI tool from TopazLab</a></li><li>it increases the FPS using <a href="https://sites.google.com/view/wenbobao/dain">Depth-Aware Video Frame Interpolation (Dain)</a></li></ul><p><a href="https://www.youtube.com/channel/UCD8J_xbbBuGobmw_N5ga3MA">Denis Shiryaev</a> says anyone could have done this and the credit should go to the authors of the algorithm that make them public on GitHub. However, it is quite funny to see how fast it got viral on the web, and how it went far beyond the data-scientist community.</p><p>Following the publication of the video, <a href="https://www.youtube.com/channel/UCW1frrIYupE8TRo5GhfthTg">DeOldify</a> released <a href="https://youtu.be/EqbOhqXHL7E">a colorized version of this video</a>.</p><h3>6 — Using ‘Radioactive Data’ to Detect if a Data Set was Used for Training</h3><p>Another blogpost from Facebook.ai. The authors have developed a new technique to mark the images in a dataset so that researchers can determine whether a particular machine learning model has been trained using those images.</p><p>This is helpful to researchers and engineers to keep track of which data was used to train a model so they can better understand how it affects the performance of different neural networks.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/800/0*7BCFTNCMCnXipwf7" /><figcaption>Radioactive data used to train a CNN</figcaption></figure><p>The term “radioactive” data refer to the use of radioactive markers in medicine that are given to patients before radiography, so as to see a particular organ without harming the patient. Similarly, the “radioactive” marks in the data are harmless meaning they have no impact on the classification accuracy of models but are detectable with high confidence in a neural network.</p><p>Blog post: <a href="https://ai.facebook.com/blog/using-radioactive-data-to-detect-if-a-data-set-was-used-for-training/">https://ai.facebook.com/blog/using-radioactive-data-to-detect-if-a-data-set-was-used-for-training/</a></p><p>Paper: <a href="https://arxiv.org/pdf/2002.00937.pdf">https://arxiv.org/pdf/2002.00937.pdf</a></p><h3>7 — Is Modern Facial Recognition Biased?</h3><p>This <a href="https://lionbridge.ai/articles/is-modern-facial-recognition-biased/">article</a> presents a review of studies about existing solutions for facial recognition. It turns out that many of them have biases such as Asian and African-American faces are falsely identified 10 to 100 times more than Caucasian faces.</p><p>In a nutshell, these studies warn against the use of facial recognition systems to make decisions impacting human lives and advocate them to be banned from public places e.g., college campuses, calling for more regulation in 2020.</p><p>Read more here: <a href="https://lionbridge.ai/articles/is-modern-facial-recognition-biased/">https://lionbridge.ai/articles/is-modern-facial-recognition-biased/</a></p><h3>8 — Pandas 1.0.0 Released</h3><p>First pandas major release in a decade! Pandas is a well-known cornerstone library to whoever need to manipulate data in python. It all started in 2011, and its popularity has been skyrocketing ever since.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/800/0*sS84POJHv4tYKH4x" /></figure><p>No worries, it is not a huge / breaking release says the pandas core team. It is rather a symbolic milestone celebrating the growth of the pandas community.</p><p>For this occasion, the core team <a href="https://dev.pandas.io/pandas-blog/pandas-10.html">published this post</a> to share its thoughts about the past decade and the next one.</p><h3>9 — Machine Learning Co2 Impact</h3><p>Do you ever wonder about the impact on the environment when you train your algorithms? <a href="https://mlco2.github.io/impact/#compute">This online tool</a> lets you compute the Co2 emitted by your training based upon the GPU type and the Cloud provider.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/800/0*G1U8mY50HQZUzly0" /><figcaption>Co2 emitted for training 12 hours RTX2080 on AWS</figcaption></figure><p>It also gives you advice such as changing the region of computing to reduce your emission. A good way to empower data scientists :)</p><p><a href="https://mlco2.github.io/impact/#compute">https://mlco2.github.io/impact/#compute</a></p><h3>10 — Bayesian Product Ranking at Wayfair</h3><p><a href="https://www.wayfair.com/">Wayfair</a> is an online store for housing furniture proposing more than 14M products to their clients. In <a href="https://tech.wayfair.com/data-science/2020/01/bayesian-product-ranking-at-wayfair/?utm_campaign=Data_Elixir&amp;utm_source=Data_Elixir_269">this blog post</a>, data scientists at Wayfair share their Bayesian approach to the problem of showing more appealing products to their customers.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/800/0*afzAu8qeBYZy65CR" /><figcaption>Which shower curtains are more appealing for a new customer?</figcaption></figure><p>They used the <a href="https://pystan.readthedocs.io/en/latest/">pystan</a> package in Python to implement their solution. They also present other issues they encountered such as the model updating over time (as customer habits change all the time) and an interesting exploitation/exploration tradeoff: on one hand exploit the knowledge they already have to recommend appealing products, on the other hand, try new configurations to gather more data.</p><p><a href="https://tech.wayfair.com/data-science/2020/01/bayesian-product-ranking-at-wayfair/?utm_campaign=Data_Elixir&amp;utm_source=Data_Elixir_269">https://tech.wayfair.com/data-science/2020/01/bayesian-product-ranking-at-wayfair/?utm_campaign=Data_Elixir&amp;utm_source=Data_Elixir_269</a></p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=a8bfa9810b6" width="1" height="1" alt=""><hr><p><a href="https://medium.com/sicara/best-of-ai-10-articles-to-read-in-february-2020-a8bfa9810b6">Best of AI: 10 Articles To Read in February 2020</a> was originally published in <a href="https://medium.com/sicara">Sicara&#39;s blog</a> on Medium, where people are continuing the conversation by highlighting and responding to this story.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[TensorFlow 2.0 Tutorial : Optimizing Training Time Performance]]></title>
            <link>https://medium.com/sicara/tensorflow-2-0-tutorial-optimizing-training-time-performance-ba9418a8c288?source=rss----fd4c083fbb93---4</link>
            <guid isPermaLink="false">https://medium.com/p/ba9418a8c288</guid>
            <category><![CDATA[tensorflow]]></category>
            <category><![CDATA[deep-learning]]></category>
            <category><![CDATA[gpu]]></category>
            <category><![CDATA[machine-learning]]></category>
            <dc:creator><![CDATA[Raphaël Meudec]]></dc:creator>
            <pubDate>Thu, 30 Jan 2020 10:14:14 GMT</pubDate>
            <atom:updated>2020-01-30T10:14:14.075Z</atom:updated>
            <content:encoded><![CDATA[<h3>TensorFlow 2.0 Tutorial : Optimizing Training Time Performance</h3><p>This tutorial explores <strong>how you can improve training time performance of your TensorFlow 2.0</strong> model around:</p><ul><li><strong>tf.data</strong></li><li><strong>Mixed Precision Training</strong></li><li><strong>Multi-GPU Training Strategy</strong></li></ul><p>I adapted all these tricks to a custom project on image deblurring, and the result is astonishing. You can get a <strong>2–10x training time</strong> speed-up depending on your current pipeline.</p><h3>Usecase: Improving TensorFlow training time of an image deblurring CNN</h3><p>2 years ago, I published a blog post on <a href="https://www.sicara.ai/blog/2018-03-20-GAN-with-Keras-application-to-image-deblurring">Image Deblurring with GANs in Keras</a>. I thought it would be a nice transition to pass the repository in TF2.0 to understand what has changed and what are the implications on my code. In this article, I’ll train a simpler version of the model (the cnn part only).</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/800/0*EIyCfNWBYVyaUr39.png" /></figure><p>The model is a convolutional net which takes the (256, 256, 3) blurred patch and predicts the (256, 256, 3) corresponding sharp patch. It is based on the ResNet architecture and is fully convolutional.</p><h3>Step 1: Identify bottlenecks</h3><p>To optimize training speed, <strong>you want your GPUs to be running at 100% speed</strong>. nvidia-smiis nice to make sure your process is running on the GPU, but when it comes to GPU monitoring, there are smarter tools out there. Hence, the first step of this TensorFlow tutorial is to explore these better options.</p><h4>nvtop</h4><p>If you’re using an Nvidia card, the simplest solution to monitor GPU utilization over time might probably be nvtop . Visualization is friendlier than nvidia-smi , and you can track metrics over time.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/800/0*Z7PY19RdkbKa6VQ9.png" /><figcaption>nvtop screenshot</figcaption></figure><h4>TensorBoard Profiler</h4><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*raatTRTkqJRmfEWs.png" /><figcaption>TensorBoard Profile</figcaption></figure><p>By simply setting profile_batch={BATCH_INDEX_TO_MONITOR} inside the TensorBoard callback, TF adds a full report on operations performed by either the CPU or GPU for the given batch. This can help identify if your GPU is stalled at some point for lack of data.</p><h4><a href="https://github.com/rapidsai/jupyterlab-nvdashboard">RAPIDS NVDashboard</a></h4><p>This is a <strong>Jupyterlab extension which gives access to various metrics</strong>. Along with your <strong>GPU</strong>, you can also monitor elements from your motherboard (<strong>CPU</strong>, <strong>Disks</strong>, ..). The advantage is you don’t have to monitor a specific batch, but rather have a look on performance over the whole training.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*L53DX4c7SrZ_yMtL.gif" /></figure><p>Here, we can easily spot that GPU is at 40% speed most of the time. I have activated only 1 of the 2 GPUs on the computer, so total utilization is around 20%.</p><h3>Step 2: Optimize your tf.data pipeline</h3><p>The first objective is to make the GPU busy 100% of the time. To do so, we want to <strong>reduce the data loading bottleneck</strong>. If you are using a Python generator or a Keras Sequence, your data loading is probably sub-optimal. Even if you’re using tf.data, data loading can still be an issue. In my article, I initially used Keras Sequences to load the images.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/600/0*vmcnYRfPKpSs656H.png" /></figure><p>You can easily spot this phenomenom using the TensorBoard profiling. GPUs will tend to have free time while CPUs are performing multiple operations related to data loading.</p><p>Making the switch from the original Keras sequences to tf.data was fairly easy. Most operations for data loading are pretty well-supported, the <strong>only tricky part is to take the same patch on the blurred image and the real one.</strong></p><p>.…<a href="https://www.sicara.ai/blog/tensorflow-tutorial-training-time"> read the full article on sicara.ai</a></p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=ba9418a8c288" width="1" height="1" alt=""><hr><p><a href="https://medium.com/sicara/tensorflow-2-0-tutorial-optimizing-training-time-performance-ba9418a8c288">TensorFlow 2.0 Tutorial : Optimizing Training Time Performance</a> was originally published in <a href="https://medium.com/sicara">Sicara&#39;s blog</a> on Medium, where people are continuing the conversation by highlighting and responding to this story.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[Meetup Computer Vision Paris]]></title>
            <link>https://medium.com/sicara/meetup-computer-vision-paris-a7b210d19faf?source=rss----fd4c083fbb93---4</link>
            <guid isPermaLink="false">https://medium.com/p/a7b210d19faf</guid>
            <category><![CDATA[meetup]]></category>
            <category><![CDATA[meetup-paris]]></category>
            <category><![CDATA[meetup-computer-vision]]></category>
            <category><![CDATA[computer-vision]]></category>
            <dc:creator><![CDATA[Jean-Régis de VAUPLANE]]></dc:creator>
            <pubDate>Wed, 29 Jan 2020 17:08:28 GMT</pubDate>
            <atom:updated>2020-01-29T17:08:28.563Z</atom:updated>
            <content:encoded><![CDATA[<figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*zrti_038dCiPm7ofaPy7TQ.png" /><figcaption>Meetup du 08 janvier 2020 au BCG</figcaption></figure><p>Le MeetUp Computer Vision est né fin 2017 pour regrouper la communauté Computer Vision de Paris.</p><p>Le MeetUp CV est là pour permettre d’aborder des sujets de résolutions de problèmes techniques, de présenter l’état de l’art sur la recherche mais aussi d’approfondir les use case que permet la Vision.</p><p>Tous les 2 mois, la <a href="https://www.meetup.com/fr-FR/Meetup-Computer-Vision-Paris/">communauté Computer Vision</a>, forte de 1700 membres début 2020, se réunit à Paris le temps d’un meetup. Le temps d’une soirée, des data scientists viennent présenter leurs apprentissages sur un sujet d’étude ou un cas client. Après les interventions, la communauté échange autour d’un cocktail. Voici quelques exemples de talks qui ont eu lieu :</p><h4>Meetup Computer Vision #8 — Few shot learning for classification in production par Clément Walter, lead data scientist @Sicara.</h4><p>Les capacités de classification des réseaux de neurones profonds ont démultipliés les cas d’application de la reconnaissance d’image. Cependant ceux-ci nécessitent d’important volumes de données pour être ajustés (trained) au problème considéré.<br>Le few-shot learning s’intéresse justement aux cas où ces volumes de données ne sont pas disponibles : comment ajuster un réseau lorsqu’on n’a qu’un seul représentant par classe ?<br>Cette présentation commencera par présenter le paradigme du few-shot learning et insistera sur la complexité de l’utilisation concrète d’algorithmes type Siamese en production. La présentation sera illustrée par du code open source. Vous pouvez retrouver les slides de présentation <a href="https://docs.google.com/presentation/d/13sVnlxdgCnwjmSaRgbtUvRgYZ7dQDEyB47H2syQA8do/edit">ici</a>.</p><h4>Meetup Computer Vision #9 — The photo lifecycle, from the photons to the eye, by Juliette Chataigner, Data Scientist @Meero.</h4><p>Le talk était composé de 3 parties :<br>- Acquisition: Comparaison du post-traitement d’un appareil photo reflex numérique et d’un smartphone.<br>- Editing: Transformer une image brute en une belle.<br>- Conclusion sur un dernier problème : l’affichage (différents affichages, calibrage, profils…) et comment ils affectent l’image.</p><iframe src="https://cdn.embedly.com/widgets/media.html?src=https%3A%2F%2Fwww.youtube.com%2Fembed%2FCjGmJpNnM1E%3Fstart%3D355%26feature%3Doembed%26start%3D355&amp;display_name=YouTube&amp;url=https%3A%2F%2Fwww.youtube.com%2Fwatch%3Fv%3DCjGmJpNnM1E&amp;image=https%3A%2F%2Fi.ytimg.com%2Fvi%2FCjGmJpNnM1E%2Fhqdefault.jpg&amp;key=a19fcc184b9711e1b4764040d3dc5c07&amp;type=text%2Fhtml&amp;schema=youtube" width="854" height="480" frameborder="0" scrolling="no"><a href="https://medium.com/media/9d20b2f8f29d9a6e671ac2dd50db4ed8/href">https://medium.com/media/9d20b2f8f29d9a6e671ac2dd50db4ed8/href</a></iframe><h4>Meetup Computer Vision #10 — Pierre Marcenac, lead data scientist @Kili, “How to scale training data?”</h4><p>“Il vaut mieux avoir un algorithme correct sur beaucoup de données, qu’un algorithme excellent sur peu de données”<br>Ainsi, la labellisation de données, même si parfois pénible, une étape critique dans un projet de machine learning. Lorsqu’on veut annoter des données en très grande quantité, il faut et une interface fluide, mais aussi du machine learning (pour faire de la pré-annotation par exemple). L’enjeu réside alors à réaliser le travail d’annotation sans dégrader la qualité des labels. Nous vous montrons comment Kili utilise le machine learning pour annoter de très grandes quantités de données sans dégrader la qualité du process.</p><iframe src="https://cdn.embedly.com/widgets/media.html?src=https%3A%2F%2Fwww.youtube.com%2Fembed%2FGPQrzZjc3aM%3Fstart%3D773%26feature%3Doembed%26start%3D773&amp;display_name=YouTube&amp;url=https%3A%2F%2Fwww.youtube.com%2Fwatch%3Fv%3DGPQrzZjc3aM&amp;image=https%3A%2F%2Fi.ytimg.com%2Fvi%2FGPQrzZjc3aM%2Fhqdefault.jpg&amp;key=a19fcc184b9711e1b4764040d3dc5c07&amp;type=text%2Fhtml&amp;schema=youtube" width="854" height="480" frameborder="0" scrolling="no"><a href="https://medium.com/media/8a6e4b9b80386d7c8405da0211109628/href">https://medium.com/media/8a6e4b9b80386d7c8405da0211109628/href</a></iframe><h4>Autres talks computer vision présentés lors des MeetUps de 2018–2019 :</h4><ul><li>La mise en place d’un moteur de recommandation pour le e-commerce basé sur la similarité visuelle des produits pour augmenter le taux de conversion (Olivier Chancé, @Sicara)</li><li>Le développement par Ubble d’une solution d’identification en ligne simple et fiable, en mettant un effort particulier sur la création d’algorithmes de détection de fraude. Disposant d’un niveau de données inégal en fonction des problématiques traitées, l’équipe a mis au point une solution unique mêlant computer vision classique et deep learning.</li><li>L’entraînement d’un modèle qui reconnaît les références de sac à main dans des vidéos de défilés de mode et sur Instagram afin de prédire les ventes (Matthieu Montaigu et Kasra Mansouri, Data Scientists @Artefact)</li><li>La détection en temps réel des attaques de guichets automatiques bancaires par un système embarqué (Grégoire Martinon, DS @Quantmetry).</li><li>L’acquisition et la labellisation d’un dataset par crowdsourcing ainsi que la détection dans un contexte à forte densité d’objets, (Augustin Rudigoz et Bruno Peyrou, @Mobeye App).</li></ul><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=a7b210d19faf" width="1" height="1" alt=""><hr><p><a href="https://medium.com/sicara/meetup-computer-vision-paris-a7b210d19faf">Meetup Computer Vision Paris</a> was originally published in <a href="https://medium.com/sicara">Sicara&#39;s blog</a> on Medium, where people are continuing the conversation by highlighting and responding to this story.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[Hands on hyperparameter tuning with Keras Tuner]]></title>
            <link>https://medium.com/sicara/hands-on-hyperparameter-tuning-with-keras-tuner-aeef4a58509?source=rss----fd4c083fbb93---4</link>
            <guid isPermaLink="false">https://medium.com/p/aeef4a58509</guid>
            <category><![CDATA[computer-vision]]></category>
            <category><![CDATA[hyperparameter-tuning]]></category>
            <category><![CDATA[tensorflow2]]></category>
            <category><![CDATA[keras]]></category>
            <category><![CDATA[machine-learning]]></category>
            <dc:creator><![CDATA[Juliep]]></dc:creator>
            <pubDate>Wed, 22 Jan 2020 13:34:50 GMT</pubDate>
            <atom:updated>2020-01-30T10:30:41.986Z</atom:updated>
            <content:encoded><![CDATA[<p>This post will explain how to perform <strong>automatic hyperparameter tuning</strong> with <strong>Keras Tuner and Tensorflow 2.0</strong> to <strong>boost accuracy</strong> on a computer vision problem.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*tC8mhvqIvlZhNR3M.jpg" /></figure><p>Here you are : your model is running and producing a first set of results. However they fall far from the top results you were expecting. You’re missing one crucial step : hyperparameter tuning!</p><p>In this post, we’ll go through a whole hyperparameter tuning pipeline <strong>step by step</strong>. Full code is available on <a href="https://github.com/JulieProst/keras-tuner-tutorial">Github</a>.</p><h3>What is hyperparameter tuning and why you should care</h3><p>A machine learning model has two types of parameters:</p><ul><li><strong>trainable parameters</strong>, which are learned by the algorithm during training. For instance, the weights of a neural network are trainable parameters.</li><li><strong>hyperparameters</strong>, which need to be set before launching the learning process. The learning rate or the number of units in a dense layer are hyperparameters.</li></ul><p>Hyperparameters can be numerous even for small models. Tuning them can be a real brain teaser but worth the challenge: <strong>a good hyperparameter combination can highly improve your model’s performance</strong>. Here we’ll see that on a simple CNN model, it can help you <strong>gain 10% accuracy </strong>on the test set!</p><p>Thankfully, open-source libraries are available to automatically perform this step for you!</p><h3>Tensorflow 2.0 and Keras Tuner</h3><p><a href="https://www.tensorflow.org/">Tensorflow</a> is a vastly used, open-source, machine learning library. In September 2019, <a href="https://blog.tensorflow.org/2019/09/tensorflow-20-is-now-available.html">Tensorflow 2.0 was released</a> with major improvements, notably in user-friendliness. With this new version, <a href="https://keras.io/">Keras</a>, a higher-level Python deep learning API, became Tensorflow’s main API.</p><p>Shortly after, the Keras team released <a href="https://keras-team.github.io/keras-tuner/">Keras Tuner</a>, <strong>a library to easily perform hyperparameter tuning with Tensorflow 2.0</strong>. This post will show <strong>how to use it</strong> with an <strong>application to object classification</strong>. It will also include a comparison of the different hyperparameter tuning methods available in the library.</p><h3>Hyperparameter tuning with Keras Tuner</h3><p>Before diving into the code, a bit of theory about Keras Tuner. <em>How does it work?</em></p><figure><img alt="hyperparameter-tuning-process-keras-tuner" src="https://cdn-images-1.medium.com/max/1024/0*EoF1z3GpqouqcGHi.png" /><figcaption>Hyperparameter tuning process with Keras Tuner</figcaption></figure><p>First, a <strong>tuner</strong> is defined. Its role is to <strong>determine which hyperparameter combinations should be tested</strong>. The library search function performs the iteration loop, which evaluates a certain number of hyperparameter combinations. Evaluation is performed by computing the trained model’s accuracy on a held-out validation set.</p><p>Finally, the best hyperparameter combination in terms of validation accuracy can be tested on a held-out test set.</p><h3>Getting started</h3><p>Let’s get started! With this tutorial, you’ll have an <strong>end-to-end pipeline to tune a simple convolutional network’s hyperparameters for object classification</strong> on the CIFAR10 dataset.</p><h4>Installation step</h4><p>First, install Keras Tuner from your terminal:</p><pre>pip install keras-tuner</pre><p>You can now open your favorite IDE/text editor and start a Python script for the rest of the tutorial!</p><h4>Dataset</h4><figure><img alt="computer-vision-cifar10-dataset" src="https://cdn-images-1.medium.com/max/480/0*CeEVF7iDqGDGP6t4.png" /><figcaption>CIFAR10 random samples. The dataset is composed of 60000 images belonging to one out of 10 object classes.</figcaption></figure><p>This tutorial uses the CIFAR10 dataset. CIFAR10 is a <strong>common benchmarking dataset in computer vision</strong>. It contains 10 classes and is relatively small, with 60000 images. This size allows for a relatively short training time which we’ll take advantage of to perform multiple hyperparameter tuning iterations.</p><p>Load and pre-process data:</p><iframe src="" width="0" height="0" frameborder="0" scrolling="no"><a href="https://medium.com/media/eeddba22af5cac8364e9f84f372d649b/href">https://medium.com/media/eeddba22af5cac8364e9f84f372d649b/href</a></iframe><p>The tuner expects floats as inputs, and the division by 255 is a data normalization step.</p><h4>Model definition</h4><p>Here, we’ll experiment with a simple <strong>convolutional model</strong> to classify each image into one of the 10 available classes.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*3VfqSj-3fjze1iLb.jpeg" /><figcaption>Simple CNN representation, from this great <a href="https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53">blog post about CNNs</a></figcaption></figure><p>Each input image will go through two convolutional blocks (2 convolution layers followed by a pooling layer) and a dropout layer for regularization purposes. Finally, each output is flattened and goes through a dense layer that classify the image into one of the 10 classes.</p><p>In Keras, this model can be defined as below :</p><iframe src="" width="0" height="0" frameborder="0" scrolling="no"><a href="https://medium.com/media/c9f46bcd2d2b5f2d128533d9d53cc8d6/href">https://medium.com/media/c9f46bcd2d2b5f2d128533d9d53cc8d6/href</a></iframe><h4>Search Space definition</h4><p>To perform hyperparameter tuning, we need to define the search space, that is to say <strong>which hyperparameters need to be optimized and in what range</strong>. Here, for this relatively small model, there are already 6 hyperparameters that can be tuned:</p><ul><li>the dropout rate for the three dropout layers</li><li>the number of filters for the convolutional layers</li><li>the number of units for the dense layer</li><li>its activation function</li></ul><p>In Keras Tuner, hyperparameters have a type (possibilities are Float, Int, Boolean, and Choice) and a unique name. Then, a set of options to help guide the search need to be set:</p><ul><li>a minimal, a maximal and a default value for the Float and the Int types</li><li>a set of possible values for the Choice type</li><li>optionally, a sampling method within linear, log or reversed log. Setting this parameter allows to add prior knowledge you might have about the tuned parameter. We’ll see in the next section how it can be used to tune the learning rate for instance</li><li>optionally, a step value, i.e the minimal step between two hyperparameter values</li></ul><p>For instance, to set the hyperparameter ‘number of filters’ you can use:</p><iframe src="" width="0" height="0" frameborder="0" scrolling="no"><a href="https://medium.com/media/310671af515ccbad1a82e2ad1e425722/href">https://medium.com/media/310671af515ccbad1a82e2ad1e425722/href</a></iframe><p>The dense layer has two hyperparameters, the number of units and the activation function:</p><iframe src="" width="0" height="0" frameborder="0" scrolling="no"><a href="https://medium.com/media/c082a4bd253e6e1afc7e80b1917996b4/href">https://medium.com/media/c082a4bd253e6e1afc7e80b1917996b4/href</a></iframe><h4>Model Compilation</h4><p>Then let’s move to model compilation, where other hyperparameters are also present…</p><p>…<a href="https://www.sicara.ai/blog/hyperparameter-tuning-keras-tuner">read the full article here</a></p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=aeef4a58509" width="1" height="1" alt=""><hr><p><a href="https://medium.com/sicara/hands-on-hyperparameter-tuning-with-keras-tuner-aeef4a58509">Hands on hyperparameter tuning with Keras Tuner</a> was originally published in <a href="https://medium.com/sicara">Sicara&#39;s blog</a> on Medium, where people are continuing the conversation by highlighting and responding to this story.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[Optimize Response Time of your Machine Learning API in Production]]></title>
            <link>https://medium.com/sicara/optimize-response-time-of-your-machine-learning-api-in-production-1c32944a6a04?source=rss----fd4c083fbb93---4</link>
            <guid isPermaLink="false">https://medium.com/p/1c32944a6a04</guid>
            <category><![CDATA[response-time]]></category>
            <category><![CDATA[api]]></category>
            <category><![CDATA[machine-learning]]></category>
            <dc:creator><![CDATA[Yannick Wolff]]></dc:creator>
            <pubDate>Mon, 13 Jan 2020 16:34:32 GMT</pubDate>
            <atom:updated>2020-01-30T10:37:07.813Z</atom:updated>
            <content:encoded><![CDATA[<p>This article demonstrates how building a <strong>smarter</strong> <strong>API</strong> serving <strong>Deep Learning models</strong> <strong>minimizes </strong>the<strong> response time</strong>.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*G1z6Bsqkc5wfvyx2.jpeg" /></figure><p>Your team worked hard to build a <strong>Deep Learning model</strong> for a given task <em>(let’s say: detecting bought products in a store thanks to Computer Vision).</em> Good.</p><p>You then developed and deployed an <strong>API that integrates this model</strong> <em>(let’s keep our example: self-checkout machines would call this API)</em>. Great!</p><p>The new product is working well and you feel like all the work is done.</p><p>But since the manager decided to install more self-checkout machines <em>(I really like this example)</em>, users have started to complain about the huge latency that occurs each time they are scanning a product.</p><p>What can you do? Buy 10x faster — and 10x more expensive — GPUs? Ask data scientists to try reducing the depth of the model without degrading its accuracy?</p><p>Cheaper and easier solutions exist, as you will see in this article.</p><h3>A basic API with a big dummy model</h3><p>First of all, we’ll need a model with a <strong>long inference time</strong> to work with. Here is how I would do that with <a href="https://www.tensorflow.org/"><strong>TensorFlow </strong></a>2’s <strong><em>Keras</em></strong> API (if you’re not familiar with this Deep Learning framework, just step over this piece of code):</p><iframe src="" width="0" height="0" frameborder="0" scrolling="no"><a href="https://medium.com/media/261c4a469e19f1ba59e3888100fd47be/href">https://medium.com/media/261c4a469e19f1ba59e3888100fd47be/href</a></iframe><p>When testing the model on my <a href="https://www.nvidia.com/fr-fr/geforce/graphics-cards/rtx-2080/">GeForce RTX 2080</a> GPU, I measured an <strong>inference time</strong> of <strong>303 ms</strong>.<strong> </strong>That’s what we can call a <em>big</em> model.</p><p>Now, we need a <strong>very simple API </strong>to serve our model, with only one route to ask for a prediction. A very standard API framework in Python is <a href="https://www.palletsprojects.com/p/flask/"><strong>Flask</strong></a>. That’s the one I chose, along with a <a href="https://www.fullstackpython.com/wsgi-servers.html">WSGI HTTP Server</a> called <a href="https://gunicorn.org/"><strong>Gunicorn</strong></a>. Our unique route parses the input from the request, calls the instantiated model on it and sends the output back to the user.</p><iframe src="" width="0" height="0" frameborder="0" scrolling="no"><a href="https://medium.com/media/475b0c41781133f60eaef2669b464ad4/href">https://medium.com/media/475b0c41781133f60eaef2669b464ad4/href</a></iframe><p>We can run our deep learning API with the command:</p><pre>gunicorn wsgi:app</pre><p>Okay, I can now send some random numbers to my API and it responds to me with some other random numbers. The question is: how fast?</p><h3>Let’s load test our API</h3><p>Read the full article on <a href="https://www.sicara.ai/blog/optimize-response-time-api">Sicara’s blog</a>.</p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=1c32944a6a04" width="1" height="1" alt=""><hr><p><a href="https://medium.com/sicara/optimize-response-time-of-your-machine-learning-api-in-production-1c32944a6a04">Optimize Response Time of your Machine Learning API in Production</a> was originally published in <a href="https://medium.com/sicara">Sicara&#39;s blog</a> on Medium, where people are continuing the conversation by highlighting and responding to this story.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[The Best of AI Articles Published in December 2019]]></title>
            <link>https://medium.com/sicara/the-best-of-ai-articles-published-in-december-2019-49092afcf41?source=rss----fd4c083fbb93---4</link>
            <guid isPermaLink="false">https://medium.com/p/49092afcf41</guid>
            <category><![CDATA[data-science]]></category>
            <category><![CDATA[ai]]></category>
            <category><![CDATA[articles]]></category>
            <category><![CDATA[best-of]]></category>
            <dc:creator><![CDATA[Jean-Baptiste Jézéquel]]></dc:creator>
            <pubDate>Thu, 09 Jan 2020 10:18:35 GMT</pubDate>
            <atom:updated>2020-01-09T10:39:34.796Z</atom:updated>
            <content:encoded><![CDATA[<h4>Deeper Fakes, responsible data science and Artificial General Intelligence, while listening to an AI-generated Christmas carol!</h4><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*l58-MrTKOqzM_VFA.png" /><figcaption>What happened last December?</figcaption></figure><p>Quick reminder if you’re not familiar with the concept: we’re a deep tech company specialized in Computer Vision based in Paris. Every month we share our <strong>ten favorite AI-related articles </strong>(or other stuff). This is the digest of <strong>December 2019</strong>.</p><p>I know you’re probably not hungry after the Holidays but here’s the article’s menu: better and stronger <strong>deep fakes</strong>, socially and environmentally <strong>responsible data science</strong>, and <strong>Artificial General Intelligence</strong>. New decade, time to adapt, I’m kicking off with a meme instead of a comic. If you’re a data scientist under 30 years old and don’t get it, I’d love your feedback.</p><figure><img alt="Sigmoid car v.s. ReLU car" src="https://cdn-images-1.medium.com/max/548/0*P_l5kKCE9TLZYAHz.png" /><figcaption>Bottom one is Tesla’s new “Cybertruck” (<a href="https://www.tesla.com/cybertruck">order now</a> if you feel like you just have too much money)</figcaption></figure><h4>An AI-generated Christmas Song</h4><figure><img alt="The AI XMAS song generated with GPT-2" src="https://cdn-images-1.medium.com/max/334/0*x8yta8rPp-Y5G_eE.png" /><figcaption>The AI XMAS song generated with GPT-2</figcaption></figure><p>I’m going to open up this Best Of AI with <a href="https://soundcloud.com/theforevernow/ai-xmas-rudolph-the-red-nosed-king-of-all-earth">the song that has been stuck in my head for a few weeks</a>. It’s <strong>a Christmas song</strong> and — brace yourself! — it’s not <a href="https://www.youtube.com/watch?v=fGFNmEOntFA">Mariah Carey</a>. Interpretation is from a musician in Denmark. <strong>Lyrics are from a neural network.</strong></p><p>Research scientist Janelle Shane trained a <a href="https://openai.com/blog/better-language-models/">GPT-2</a> on 240 Christmas carols. Then she asked her model to produce a song about Rudolph the Red-Nosed Reindeer. <a href="https://tmblr.co/ZP7VLs2mph0RM">The result</a> is brilliantly terrible.</p><p>The interpretation is so pure that my family didn’t notice anything weird when I played this at Christmas dinner. Don’t hesitate to use this as background music while you keep reading!</p><h4>What happened in the field in 2019?</h4><figure><img alt="Time for an AI-rewind" src="https://cdn-images-1.medium.com/max/782/0*cWwXWMYEbbYHZHb-.jpg" /><figcaption>Let’s wrap up what happend in 2019</figcaption></figure><p>2019 is over and I reckon it’s healthy to <strong>look back at what happened</strong> this year before jumping into a new year. Former Research Director at Netflix Xavier Amatriain can help you with that. <a href="https://medium.com/@xamat/the-year-in-ai-2019-ml-ai-advances-recap-c6cc1d902d5">His review</a> gives a nice perspective of this year’s <strong>achievements and challenges in Machine Learning</strong>.</p><h4>StyleGAN2: the revenge of deep fakes</h4><figure><img alt="Fluid transition between fake faces" src="https://cdn-images-1.medium.com/max/720/0*4S6swDAeCEg8SH34.gif" /><figcaption>Smooth interpolation between StyleGAN2’s outputs (full video)</figcaption></figure><p>Last year, Nvidia created a huge sensation when they introduced <strong>StyleGAN</strong> (the generative algorithm behind <a href="https://thispersondoesnotexist.com/">thispersondoesnotexist.com</a>). Some people were like “awesome! we can do that now?” and others feared that it could be used in <a href="https://www.theguardian.com/technology/ng-interactive/2019/jun/22/the-rise-of-the-deepfake-and-the-threat-to-democracy">a very malicious way</a>.</p><p>Apparently, it was not enough, so they just made <strong>StyleGAN2</strong>. They fixed some flaws of the first version (generated images often had water droplets on the background; now they don’t). They also added some psychedelic new features like a <strong>smooth latent space</strong>, which is responsible for the animation above.</p><p>You can find the full paper <a href="http://arxiv.org/abs/1912.04958">here</a> and the code <a href="https://github.com/NVlabs/stylegan2">there</a>. <a href="https://towardsdatascience.com/stylegan2-ace6d3da405d">Here</a> is a blog post explaining all the changes.</p><h4>Deploy models to production without unfair bias</h4><figure><img alt="Sketch of Human Bias in an ML pipeline" src="https://cdn-images-1.medium.com/max/640/0*4mduVyAeMlqmd_N3.png" /><figcaption>So basically a Machine Learning pipeline comes with a lot of human bias (<a href="https://ai.googleblog.com/2019/12/fairness-indicators-scalable.html">credit</a>)</figcaption></figure><p>Machine Learning might be fun, but we have to keep in mind that it is not a game. The code we write <strong>influences people’s lives</strong>. Our algorithms learn from the example we provide them. As such, they are prone to replicate every <strong>unfair bias</strong> present in the data. For instance, gender or race can become a factor in credit decision or resume classification <a href="https://towardsdatascience.com/preventing-machine-learning-bias-d01adfe9f1fa">(even if it’s not an explicit input)</a>.</p><p>We want our algorithms to be better than us, not to reproduce our mistakes. Following this idea, Google released <a href="https://ai.googleblog.com/2019/12/fairness-indicators-scalable.html">Fairness Indicators</a> this month. It’s a suite of tools built on TensorFlow to help data scientists<strong> diagnose unfair biases in their model</strong>. A good step in the right direction!</p><h4>Machine Unlearning</h4><figure><img alt="An eraser" src="https://cdn-images-1.medium.com/max/1024/0*P-IpvVM0nvGN_sCM.jpeg" /><figcaption>How does a neural network forget data?</figcaption></figure><p>Just like people whose data is treated by AI algorithms have the right to know that they were not affected by unfair bias, they also have the right to <strong>ask for their data to be deleted</strong>. The problem is that any model trained with their data may <strong>have it memorized</strong>, and it would be incredibly expensive to re-train all your models every time a data instance is removed.</p><p>Researchers from the <a href="https://www.utoronto.ca/">University of Toronto</a> introduce a new way to train deep neural networks so that they can <strong><em>unlearn</em> more easily</strong> when a data instance is removed from the training set. A great step towards total <a href="https://eur-lex.europa.eu/eli/reg/2016/679/oj"><strong>GDPR</strong></a><strong> compliance</strong>, and it seems that it can also be used when some examples simply become irrelevant. You can find their article <a href="https://arxiv.org/abs/1912.03817">on arXiv</a>.</p><h4>What’s the environmental impact of your model?</h4><figure><img alt="A lot of pollution" src="https://cdn-images-1.medium.com/max/1024/0*Ym_0Cv7IM_2g2bNr.jpeg" /><figcaption>Training a model has as much effect as a transatlantic flight</figcaption></figure><p>Another responsibility as a data scientist is towards the environment. GPU computing has a <strong>big environmental impact</strong>: training state-of-the-art Deep Learning models now take up to years of computing time (of course they are distributed in many units, so we have them up-and-ready in a matter of days).</p><p>Following their recent <a href="https://arxiv.org/abs/1910.09700">paper</a> on quantifying carbon emissions of Machine Learning, a team from Montreal published a website on which <strong>you can evaluate your own emissions</strong> in three clicks! The core idea is to include this information in future research papers so that we no longer ignore the environmental impact of our work.</p><p>For fun, I wanted to see what it took to train GPT-2 (the model that produced our beautiful Christmas song). One training process produces <strong>as much CO2 as a round trip between Paris and Los Angeles</strong>. You can only imagine the cost of the <a href="https://www.sicara.ai/blog/2019-14-07-determine-network-hyper-parameters-with-bayesian-optimization">hyper-parameter tuning</a>!</p><h4>Solving differential equations with a neural network</h4><figure><img alt="A differential equation that is now easily solvable using AI" src="https://cdn-images-1.medium.com/max/1024/0*Q0jojpUMA_fKjRop.jpeg" /><figcaption>Neural nets to solve math equations</figcaption></figure><p>To be honest, when I first read that there was now a neural network that can <strong>solve differential equations</strong> or <strong>calculate integrals</strong>, I didn’t care a bit. I just assumed this was something conventional solvers had been doing for decades, so why bother? Turns out I was wrong and it’s really a big deal: deterministic state of the art only reached 85% accuracy on function integration, but this new solution is close to 100%, with <strong>an inference time under one second</strong>!</p><p>The research paper is <a href="https://arxiv.org/abs/1912.01412">here</a>, and <a href="https://www.technologyreview.com/s/614929/facebook-has-a-neural-network-that-can-do-advanced-math/">here</a> is an article in the MIT Technology Review that summarizes it very well.</p><h4>NeurIPS 2019 Keynote: The future of Deep Learning According to Yoshua</h4><iframe src="https://cdn.embedly.com/widgets/media.html?src=https%3A%2F%2Fwww.youtube.com%2Fembed%2FT3sxeTgT4qc%3Ffeature%3Doembed&amp;url=https%3A%2F%2Fwww.youtube.com%2Fwatch%3Fv%3DT3sxeTgT4qc&amp;image=https%3A%2F%2Fi.ytimg.com%2Fvi%2FT3sxeTgT4qc%2Fhqdefault.jpg&amp;key=a19fcc184b9711e1b4764040d3dc5c07&amp;type=text%2Fhtml&amp;schema=youtube" width="854" height="480" frameborder="0" scrolling="no"><a href="https://medium.com/media/850f064cf7d13560c7fa0665a7304128/href">https://medium.com/media/850f064cf7d13560c7fa0665a7304128/href</a></iframe><p>It’s hard to imagine the best of AI this month without mentioning <a href="https://nips.cc/">the NeurIPS conference</a>. There were a lot of interesting talks on current and future challenges of Machine Learning. I can’t mention them all, so I’ll focus on the one that impressed me the most.</p><p>Founding father of Deep Learning <strong>Yoshua Bengio</strong> talked about the <strong>System 2 Deep Learning</strong> paradigm: what current Deep Learning is missing to <strong>match human intelligence;</strong> promising leads on how to face the challenges of compositionality, causality and out-of-distribution generalization. One of these leads is, of course, <strong>meta-learning</strong>, <a href="https://www.sicara.ai/blog/2019-07-30-image-classification-few-shot-meta-learning">which has been a research interest at Sicara for some time now</a>.</p><p>If you missed it, I can but strongly advise you to rectify this as soon as you get an hour of free time. If you don’t have an hour, the 12 first minutes may suffice as an introduction to these challenges that will surely shape the future of AI!</p><h4>We need a better measure of intelligence than chess and Starcraft</h4><figure><img alt="François CHollet and Sicara&#39;s data scientists" src="https://cdn-images-1.medium.com/max/1024/0*KDd5QujenW4oDGlu.png" /><figcaption>Creator of Keras François Chollet (4th from the left) with our team at Sicara</figcaption></figure><p>As a scientist, what thrills me the most in the field of Machine Learning is Artificial General Intelligence (AGI). It refers to <strong>an AI that can learn any task that a human can</strong>. And it is all that current AI isn’t, as Yoshua Bengio explained.</p><p><a href="https://www.theverge.com/2019/12/19/21029605/artificial-intelligence-ai-progress-measurement-benchmarks-interview-francois-chollet-google?fbclid=IwAR0jI4MQVdcVjHjlOPsd24uJe9oA_4cYaiXWthVmzydOtCwfItX-MEAcIcY">This article</a> is an interview with researcher François Chollet. This is about <strong>how we measure intelligence</strong>. And why we need to find <strong>better benchmarks than video games</strong> or board games, if we want to do more than designing AI that harness millions of examples and thousands of years of computing time to learn one specific task.</p><h4>ObjectNet: the proof that you’re smarter than a CNN</h4><figure><img alt="Left: oven gloves on a bed. Right: a hammer on a hand." src="https://cdn-images-1.medium.com/max/1024/0*pLqSZqiwwdOrtC0C.png" /><figcaption>Examples of images in ObjectNet</figcaption></figure><p>A perfect example of the<strong> terrible generalization abilities</strong> of our Machine Learning algorithms was provided this month by MIT and IBM researchers. They spent three years designing <strong>ObjectNet</strong>. This dataset is like <strong>a parody of ImageNet</strong>, where objects are taken out of context or in odd positions, and shot at random angles. They used it to test object detectors trained on ImageNet, and — surprise! — the accuracy was<strong> cut in half</strong>. <a href="http://news.mit.edu/2019/object-recognition-dataset-stumped-worlds-best-computer-vision-models-1210">Here</a> is the article on MIT News explaining everything.</p><p>I just love this. Seeing state-of-the-art algorithms trained for weeks on millions of images fail to recognize a hammer because it’s <em>on</em> a hand and not <em>in</em> a hand. It shows us how much progress we still have to make.</p><p>That’s it for December and subsequently for 2019. Now 2020 will be what you make it. Do you need data science services for your business? Do you want to apply for a data science job at Sicara? Feel free to <a href="https://www.sicara.ai/contact-us">contact us</a>, we would be glad to welcome you in our Paris office.</p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=49092afcf41" width="1" height="1" alt=""><hr><p><a href="https://medium.com/sicara/the-best-of-ai-articles-published-in-december-2019-49092afcf41">The Best of AI Articles Published in December 2019</a> was originally published in <a href="https://medium.com/sicara">Sicara&#39;s blog</a> on Medium, where people are continuing the conversation by highlighting and responding to this story.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[The Best of AI: New Articles Published This Month (November 2019)]]></title>
            <link>https://medium.com/sicara/the-best-of-ai-new-articles-published-this-month-november-2019-5f9699ff26f6?source=rss----fd4c083fbb93---4</link>
            <guid isPermaLink="false">https://medium.com/p/5f9699ff26f6</guid>
            <category><![CDATA[machine-learning]]></category>
            <category><![CDATA[artificial-intelligence]]></category>
            <category><![CDATA[data-science]]></category>
            <category><![CDATA[best-of]]></category>
            <category><![CDATA[articles]]></category>
            <dc:creator><![CDATA[Jean]]></dc:creator>
            <pubDate>Thu, 12 Dec 2019 11:53:25 GMT</pubDate>
            <atom:updated>2019-12-12T13:17:33.368Z</atom:updated>
            <content:encoded><![CDATA[<p><em>10 data articles handpicked by the Sicara team, just for you</em></p><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*sG48SRw1vRkzls8c" /></figure><p>Welcome to the November edition of our best and favorite articles in AI that were published this month. We are a Paris-based company that does Agile data development.</p><p>This month, we spotted articles about AI that can identify who wrote each scene in Shakespeare’s Henry VIII, and teach non-native speakers how to pronounce English words! Let’s start, as usual, with the comic of the month:</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/290/0*PuC2ioztrZv7Lz2m" /></figure><h3>1 — Predict Impact of a Song on our emotions</h3><p><em>Machine Learning, Music and Emotions</em></p><figure><img alt="" src="https://cdn-images-1.medium.com/max/696/0*b1RIFXh_3ASEqC5C" /><figcaption>Man playing guitar near trees and and body of water — Priscilla Du Preez</figcaption></figure><p>In a <a href="https://dl.acm.org/citation.cfm?id=3343031.3350867">recent article</a> researchers describe how they trained machine-learning algorithms to predict what features in a song would impact people’s emotional responses.</p><p>They predicted brain and heart activities as well as physiological response using features based on music dynamics such as timbre, harmony, etc…</p><p>This work helps to understand how music affects human experience and has applications in music emotion recognition and neuroscience.</p><h3>2 — A Mobile App to Improve English Pronunciation of Non-Native Speakers</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/692/0*w7bkOCj9xDaod2gt" /><figcaption>Credit: CC0 Public Domain</figcaption></figure><p>How to improve your English pronunciation if — like me — you do not always understand why your sentence is wrongly enunciated? A <a href="https://phys.org/news/2019-10-bellevue-startup-artificial-intelligence-english.html">startup used machine learning to tackle this challenge!</a> <a href="https://www.bluecanoelearning.com/">Blue Canoe</a> created a mobile app directing its users to repeat sentence prompts. Speech-recognition technology then analyzes the recordings and uses machine-learning models to point out the differences. When users spend 10 minutes per day on the app, personalized feedback informs students precisely how they mispronounced words. The startup started by digitizing a 20-year-old methodology called the <a href="https://americanenglish.state.gov/resources/color-vowel-chart">Color Vowel System</a>. Then, they hired linguists to listen to users’ recordings and tag the problems. Recordings are then used to improve machine-learning models.</p><h3>3 — Drones to spot missing people</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/693/0*dFH8kmwiQkm-V3N4" /><figcaption>Flying a drone at dusk in the city — Goh Rhy Yan</figcaption></figure><p>While California was last month the third state to <a href="https://securitytoday.com/articles/2019/10/10/california-to-become-third-state-to-ban-facial-recognition-software-in-police-body-cameras.aspx">frame Police use of facial recognition softwares</a>, Police Scotland unveiled this month a new drone using computer vision to search missing and vulnerable people reported <a href="https://www.bbc.com/news/uk-scotland-50262650">BBC</a>. Its recognition software is lightweight enough to be used on a smartphone and uses an optical camera and a sensor detecting heat. Police Scotland’s air support unit detailed aspects of its drone to argue it will not be used to spy citizens: ”We’ll comply fully with all the human rights legislation — in fact a data protection impact assessment has been carried out and we review that yearly. Also, before we deploy we’ll use social media to tell the public this is what we’re doing. ”In addition, its blue light and the sound of its rotors are supposed to alert people of its presence, believes <a href="https://www.bbc.com/news/uk-scotland-50262650">BBC</a>.</p><h3>4 — Video recognition by Facebook</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/600/0*m8RsU-xrAl8thTjl" /><figcaption>Facebook’s SlowFast classifying a video. Image Credit: Facebook</figcaption></figure><p><a href="https://venturebeat.com/2019/11/04/facebooks-slowfast-video-classifier-ai-was-inspired-by-primate-eyes/">PySlowFast</a> — Facebook’s video recognition system — is now available on <a href="https://github.com/facebookresearch/SlowFast">GitHub</a> and its mechanisms explained in a <a href="https://arxiv.org/pdf/1812.03982.pdf">preprint paper</a>. The main intuition of this system is to reproduce primate’s eye cells. These cells are either functioning at low frequency and focusing on fine details either responding to swift changes. Transposed to this video’s recognition system: the video is treated at a low and at a higher temporal rate. The lower to recognize static areas and the higher to recognize dynamic areas. This model has been confronted to two popular datasets: DeepMind’s Kinetics-400 and Google’s AVA and achieved state-of-the-art results on both.</p><h3>5 — Full Release of Controversial GPT-2 Text Generating AI</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/693/0*1Q5MT2eOA2_SycxD" /></figure><p>Last November 5, OpenAi finally <a href="https://openai.com/blog/gpt-2-1-5b-release/">released</a> the largest version of its controversial model GPT-2, claiming they have not found “strong evidence of misuse so far”. GPT-2 is a deep learning model able to output credible text from a minimal prompt (demo <a href="https://talktotransformer.com/">here</a>). This full version was not originally released last February because OpenAi was concerned it could be used to automatically produce Fake News (summary of the debate <a href="https://www.theverge.com/2019/2/21/18234500/ai-ethics-debate-researchers-harmful-programs-openai">here</a>). They motivated this late release by the following arguments:</p><ul><li>this model version has only a marginally greater “credibility score” compared to already released version (according to a survey by Cornell University).</li><li>they acknowledge that “GPT-2 can be fine-tuned for misuse” but argue that “despite having low detection accuracy on synthetic outputs, ML-based detection methods can give experts reasonable suspicion that an actor is generating synthetic text”</li><li>they conducted in-house detection research and developed a that has detection rates of ~95% for detecting 1.5B GPT-2-generated text. By releasing this version they aim “to aid the study of research into the detection of synthetic text, although this does let adversaries with access better evade detection”.</li></ul><h3>6 — French BERT (CamemBERT) now available!</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/690/0*0iWEmCWLqabG95oA" /></figure><p>The French version of BERT has been released on <a href="https://github.com/huggingface/transformers">huggingface/transformers repo</a>! BERT or Bidirectional Encoder Representations from Transformers is a method based on pre-training language representations which obtained state-of-the-art results on a wide array of Natural Language Processing tasks (Google explanations <a href="https://ai.googleblog.com/2018/11/open-sourcing-bert-state-of-art-pre.html">here</a>). This French version has been trained on 138 GB of French text and is available both in Pytorch and Tensorflow 2. This release is the achievement of a collaboration between Facebook AI, <a href="https://www.inria.fr/en/">INRIA</a> and <a href="https://www.sorbonne-universite.fr/">Sorbonne Université</a>.</p><h3>7 — Recommending Apps in Google Play Store</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/692/0*a8XYS5o6dgfSh0C-" /></figure><p>In an interesting <a href="https://deepmind.com/blog/article/Advanced-machine-learning-helps-Play-Store-users-discover-personalised-apps">blog post</a>, Deepmind explained its approaches implementing recommendation algorithms for Google Play Store, in order to “help users discover personalized apps”. The first approach using LSTM (Neural Network used to treat sequences) has been replaced by Transformers, which improved the model performance, but also increased the training cost. Third and final solution was to implement “an efficient additive attention model that works for any combination of sequence features, while incurring low computational cost”. In addition, the blog post introduced recommendation bias problem and how they deal with it: “For instance, if app A is shown in the Play Store 10 times more than app B, it’s more likely to be installed by the user, and thus more likely to be recommended by our model”. They detailed refinements they introduced in re-ranking recommendations and optimizing for multiple objective, such as relevance, popularity, or personal preferences.</p><h3>8 — Determine Who Wrote each Shakespeare’s Henry VIII. Scenes</h3><p><em>A new approach on a century-lasting debate!</em></p><figure><img alt="" src="https://cdn-images-1.medium.com/max/694/0*tUFh-JJnjFBdh-oW" /><figcaption>Theater, Kuala Lumpur — Gwen Ong</figcaption></figure><p>Some literary analysts believe that Shakespeare did not write his play Henry VIII alone but has been helped by John Fletcher, the writer who replaced him as playwright of the <a href="https://en.wikipedia.org/wiki/King%27s_Men_%28playing_company%29">King’s Men</a> after his dead. In the mid-nineteenth century, literary analyst <a href="https://en.wikipedia.org/wiki/James_Spedding">James Spedding</a> already proposed a division based on the use of eleven-syllable lines. In 1962, an influential analyst divided the play between Shakespeare and Fletcher based on their distinctive word choices, for example Fletcher’s uses of ye for you and ’em for them. And last month, Petr Plecháč of the Czech Academy of Sciences in Prague claimed he has studied the problem using machine learning to identify the authorship at a more accurate level (not only attributing scenes): “Our results highly support the canonical division of the play between William Shakespeare and John Fletcher proposed by James Spedding”.</p><h3>9 — Increase Solar Panel Energy Production</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/694/0*shb5gHljzln5nsyX" /><figcaption>Solar Panels — Andreas Gucklhorn</figcaption></figure><p>A startup named Heliogen aims to <a href="https://earther.gizmodo.com/bill-gates-backed-startup-uses-ai-to-create-solar-rays-1839946842">increase solar panel energy production</a> by using advanced computer vision software. Such technology’s impact should not be limited to increase energy production. By accurately aligning mirrors Heliogen expects to be able to reach temperatures over 1,000 degrees Celsius. Such high temperatures could be used for the industrial applications that currently account for roughly 75 percent of the energy demand through fossil fuel production. In addition, this technology could ultimately provide an alternative to gasoline for powering automobiles by “spliting carbon dioxide and water molecules to produce clean-burning fuels like hydrogen”, the article explains. This AI-backed technology could therefore be a step to successfully use solar energy in fields where still dependent to fossil fuel.</p><h3>10 — Self-training with Noisy Student improves ImageNet classification</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/726/0*kVO-Sw89beFclAXR" /><figcaption><a href="https://arxiv.org/abs/1911.04252">Self-training with Noisy Student improves ImageNet classification</a></figcaption></figure><p>In an <a href="https://arxiv.org/abs/1911.04252">article</a> submitted last November 11, three researchers explained how they obtained 87.4% top-1 accuracy on ImageNet, which is 1.0% better than the state-of-the-art model that requires 3.5B weakly labeled Instagram images. <a href="http://www.image-net.org/">ImageNet</a> is a famous image database often used to measure the performance of image classification neural networks. To achieve this result, they first trained an EfficientNet on labeled ImageNet images and use it to label 300M unlabeled images (it creates pseudo-labels, as these labels are not the ground-truth but a prediction). This first EfficientNet is called the Teacher. Then they trained a larger EfficientNet — called the student — learning to classify both ImageNet and newly labeled images. They iterate this process by using the larger EfficientNet as Teacher, i.e to re-label the dataset of 300M unlabeled images. During the learning of the student, they injected noise such as data augmentation, dropout, stochastic depth to the student so that the student neural network is forced to learn harder from the pseudo labels. But during the pseudo-labelling of the 300M unlabelled images, the teacher is not noised so that the pseudo labels are as good as possible. These researchers stressed that the “main difference between [their] work and prior works is that [they] identify the importance of noise, and aggressively inject noise to make the student better”. The following results show this impact of noise on the network’s results:</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/313/0*_Y3XM7SnbxRAurl5" /><figcaption>Ablation study on noising — <a href="https://arxiv.org/abs/1911.04252">Self-training with Noisy Student improves ImageNet classification</a></figcaption></figure><p>Do you need data science services for your business? Do you want to apply for a data science job at Sicara? Feel free to <a href="https://www.sicara.ai/contact-us">contact us</a>, we would be glad to welcome you in our Paris office</p><p><em>This article was originally published on Sicara’s blog: <br></em><a href="https://www.sicara.ai/blog/11-2019-best-of-ai-november-2019">https://www.sicara.ai/blog/11-2019-best-of-ai-november-2019</a></p><p><a href="https://www.sicara.ai/blog/10-2019-best-of-ai-october-2019">Read the October edition</a><a href="https://www.sicara.ai/blog/2019-10-21-best-of-ai-september-2019"><br>Read the September edition</a><br><a href="https://www.sicara.ai/blog/2019-08-12-best-ai-july-2019">Read the July edition</a><br><a href="https://www.sicara.ai/blog/2019-07-11-best-ai-june-2019">Read the June edition</a></p><p>Some articles we recently published on our blog:</p><p><a href="https://www.sicara.ai/blog/2019-09-26-face-detectors-dsfd-state-of-the-art-algorithms">Face Detectors: Understand DSFD and the State-of-the-art Algorithms</a><br><a href="http://determine%20your%20network%20hyper-parameters%20with%20bayesian%20optimization/">Determine Your Network Hyper-parameters With Bayesian Optimization</a><br><a href="https://www.sicara.ai/blog/2019-28-10-deep-learning-memory-usage-and-pytorch-optimization-tricks">Deep Learning Memory Usage and Pytorch Optimization Tricks</a></p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=5f9699ff26f6" width="1" height="1" alt=""><hr><p><a href="https://medium.com/sicara/the-best-of-ai-new-articles-published-this-month-november-2019-5f9699ff26f6">The Best of AI: New Articles Published This Month (November 2019)</a> was originally published in <a href="https://medium.com/sicara">Sicara&#39;s blog</a> on Medium, where people are continuing the conversation by highlighting and responding to this story.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[The Best of AI: New Articles Published This Month (October 2019)]]></title>
            <link>https://medium.com/sicara/10-2019-best-ai-new-articles-this-month-fa5ef17c89ba?source=rss----fd4c083fbb93---4</link>
            <guid isPermaLink="false">https://medium.com/p/fa5ef17c89ba</guid>
            <category><![CDATA[machine-learning]]></category>
            <category><![CDATA[articles]]></category>
            <category><![CDATA[ai]]></category>
            <category><![CDATA[best-of]]></category>
            <category><![CDATA[data-science]]></category>
            <dc:creator><![CDATA[Maria Romanenko]]></dc:creator>
            <pubDate>Tue, 19 Nov 2019 10:46:18 GMT</pubDate>
            <atom:updated>2019-12-13T16:49:42.094Z</atom:updated>
            <content:encoded><![CDATA[<h4>10 data articles handpicked by the Sicara team, just for you</h4><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*jn7qheByeQWLJpfV7Wpp6A.png" /></figure><p>Read the original article on Sicara’s blog <a href="https://www.sicara.ai/blog/10-2019-best-of-ai-october-2019">here</a>.</p><p>Welcome to the <strong>October</strong> edition of our <strong>best and favorite articles in AI</strong> that were published this month. We are a Paris-based company that does <strong>Agile data development</strong>. This month, we spotted articles about AI that can solve physics problems, paint portraits, judge criminals, play video games and even recognize smells! Let’s start, as usual, with the <strong>comic of the month</strong>:</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/650/1*8ikHUD0KXNVh7h-AwD6Hxg.jpeg" /><figcaption>Source: <a href="http://www.commitstrip.com/en/2019/09/06/meanwhile-in-a-parallel-universe-5/">http://www.commitstrip.com/en/2019/09/06/meanwhile-in-a-parallel-universe-5/</a></figcaption></figure><h3>1 — AlphaStar: Grandmaster level in StarCraft II using multi-agent reinforcement learning</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*zFztjkPZC2c_SKR8neza3A.png" /><figcaption>Source: <a href="https://youtu.be/KPLYhRBCcvk">https://youtu.be/KPLYhRBCcvk</a></figcaption></figure><p>The DeepMind’s bot <strong>AlphaStar</strong> managed to enter the <strong>Grandmaster league</strong> in <strong>Starcraft II</strong>. This league is the highest of the seven ranked leagues of the game. The developers made three different versions of the agent play against real players<strong> </strong>on Battle.net. The most advanced version got ranked, on average, in the <strong>top 0.15% of all players</strong>.</p><p>By the way, if you want to develop <strong>your own Starcraft II bot</strong>, you can, just like DeepMind, use <a href="https://github.com/Blizzard/s2client-proto">the official Blizzard’s API client</a> that provides full external control of the game. If you want to take a look at the official research paper, together with DeepMind’s pseudocode, detailed architecture and datasets with game replays<strong> </strong>to train your bots, they are available <a href="https://www.nature.com/articles/s41586-019-1724-z">here</a>.</p><p>Read <a href="https://deepmind.com/blog/article/AlphaStar-Grandmaster-level-in-StarCraft-II-using-multi-agent-reinforcement-learning">AlphaStar: Grandmaster level in StarCraft II using multi-agent reinforcement learning</a> — From <a href="https://deepmind.com/blog/">DeepMind Blog</a>.</p><h3>2 — Solving Rubik’s Cube with a Robot Hand</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*mAfIDXMVmWkunDufg3efdQ.jpeg" /></figure><p><strong>OpenAI</strong> <strong>trained</strong> <strong>a robot hand</strong> that is capable of manipulating a Rubik’s cube. To be clear, this achievement is not really about solving the cube<strong>,</strong> but rather about developing an extremely dexterous agent able to perform interactions with the environment with a high degree of precision.</p><p>The whole training process took place in a simulated environment. The new method, called <strong>Automatic Domain Randomisation</strong>, generated harder and harder environments as the agent trained. For more challenge, the developers introduced various <strong>perturbations</strong> <strong>during the</strong> <strong>robustness</strong> <strong>tests </strong>with a real robot hand. My favorite one is a <a href="https://youtu.be/QyJGXc9WeNo?t=72">cute plush giraffe</a> curiously poking the cube with the tip of its nose! Others include throwing a blanket on the robot hand or making it solve the cube while wearing a rubber glove.</p><p>Read <a href="https://openai.com/blog/solving-rubiks-cube/">Solving Rubik’s Cube with a Robot Hand</a> — From <a href="https://openai.com/blog/">OpenAI Blog</a>.</p><h3>3 — Loss landscape</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*npUHVBc28Wy1L1_vR3j3kw.jpeg" /><figcaption>Source: <a href="https://losslandscape.com/">https://losslandscape.com</a></figcaption></figure><p>Neural nets are ubiquitous. But what really happens inside of them? This remains a mystery even to their developers. This new project takes you on a journey to a mesmerizing world of weirdly satisfying loss landscapes. Some of the visualizations produced by the Loss Landscape project are:</p><ul><li><strong>LR Coaster</strong> that lets you ride along the minimizer during the learning rate stress test,</li><li><strong>Sentinel</strong> that explores the optimization process of a convolutional net,</li><li><strong>WALTZ-RES</strong> that shows the difference between two ResNet networks, with and without skip connections,</li></ul><p>and many more!</p><p>Visit <a href="https://losslandscape.com">Loss Landscape</a> — By <a href="https://ideami.com/">Javier Ideami</a>.</p><h3>4 — Turn Python Scripts into Beautiful ML Tools</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/700/1*Vl_CSZlK9Te1v9wDeZEYPw.gif" /><figcaption>Source: <a href="https://github.com/streamlit/demo-self-driving">https://github.com/streamlit/demo-self-driving</a></figcaption></figure><p><strong>Streamlit</strong> is a new open-source Python framework built for machine learning engineers. As the developers promise on their <a href="https://streamlit.io">website</a>, it is “<strong>The fastest way to build custom ML tools</strong>”. Using Streamlit, you can build sleek web apps to serve your models in<strong> just a few lines of Python code!</strong></p><p>Here are the core principles of the framework<strong>:</strong></p><ul><li><strong>Scripts are awesome</strong>: every Streamlit app is a stateless Python script.</li><li><strong>No</strong> <strong>callbacks</strong>: every widget is a variable<strong>!</strong></li><li><strong>Information reuse</strong>: data and computations are cached in Streamlit’s data store that lets it safely persist information<strong>.</strong></li></ul><p><a href="https://streamlit.io/docs/getting_started.html">Try it now</a> and see for yourself!</p><p>Read <a href="https://towardsdatascience.com/coding-ml-tools-like-you-code-ml-models-ddba3357eace">Turn Python Scripts into Beautiful ML Tools</a> — From <a href="https://towardsdatascience.com/">Towards Data Science</a></p><h3>5 — Can you make AI fairer than a judge? Play our courtroom algorithm game</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/639/1*wZyhfLPKtbPEWtS-NfoYfg.png" /><figcaption>Source: <a href="https://futurama.fandom.com/wiki/Judge_723">https://futurama.fandom.com/wiki/Judge_723</a></figcaption></figure><p><strong>COMPAS</strong> is an algorithm used in the <strong>US courts</strong>. It looks at the defendant’s criminal history and outputs a “<strong>risk score</strong>”. This score reflects how likely the person under trial is to become a recidivist.</p><p>It turned out that the algorithm is <strong>racially biased</strong>, even though the score doesn’t take race into account. This piece lets you <strong>tweak</strong> the algorithm’s parameters and make it fairer!</p><p>Read <a href="https://www.technologyreview.com/s/613508/ai-fairer-than-judge-criminal-risk-assessment-algorithm/">Can you make AI fairer than a judge? Play our courtroom algorithm game</a> — By <strong>MIT Technology Review</strong></p><h3>6 — A neural net solves the three-body problem 100 million times faster</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/300/1*0wF_vX4Xbd-iGbna8f-2sw.gif" /><figcaption>Source: <a href="https://en.wikipedia.org/wiki/Three-body_problem">https://en.wikipedia.org/wiki/Three-body_problem</a></figcaption></figure><p><strong>The three-body problem </strong>is a classic physics problem of calculating the trajectories of three bodies given their initial positions and velocities<strong>. </strong>The first specific version of this problem, formulated in the 17th century, involved calculating the motion of the Earth, the Sun and the Moon.</p><p>It turned out to be an <strong>extremely hard problem to solve</strong>, since the resulting dynamic system is chaotic, except for a small number of edge cases. So far, a closed-form solution to this problem has not been found. Therefore, the solutions are generally calculated numerically, requiring enormous computational resources.</p><p>Researchers from the University of Edinburgh <strong>trained a neural network</strong> on the solutions produced by the state-of-the-art solver<strong> </strong>named<strong> </strong>Brutus<strong>. </strong>As a result, this network is able to <strong>accurately predict the motion</strong> of three bodies <strong>up to 100 million times faster</strong> than the solver.</p><p>Read <a href="https://www.technologyreview.com/s/614597/a-neural-net-solves-the-three-body-problem-100-million-times-faster/">A neural net solves the three-body problem 100 million times faster</a> — From <a href="https://www.technologyreview.com/">MIT Technology Review</a></p><h3>7 — Learning to Smell: Using Deep Learning to Predict the Olfactory Properties of Molecules</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/810/1*1yjODNONtzSISgFI_dp0Ug.jpeg" /><figcaption>Source: <a href="https://www.jedidefender.com/yabbse/index.php?topic=20503.0">https://www.jedidefender.com/yabbse/index.php?topic=20503.0</a></figcaption></figure><p>We’re no longer surprised by AI models that can see and hear things. But what about other senses? <strong>Google</strong> came up with a model that is able to figure out how different things smell<strong> </strong>by predicting <strong>smell descriptors from molecules</strong>. It can distinguish smells like vanilla, chocolate or citrus, but also more complicated ones such as spicy, beefy or creamy.</p><p>Further research in this area could make it possible to <strong>develop digital scents</strong> and to <strong>create molecules</strong> <strong>with completely new smells</strong>. It would also be incredibly useful to help those who can’t smell appreciate scents like everyone else.</p><p>Read <a href="https://ai.googleblog.com/2019/10/learning-to-smell-using-deep-learning.html">Learning to Smell: Using Deep Learning to Predict the Olfactory Properties of Molecules </a>— From <a href="https://ai.googleblog.com/">Google AI Blog</a></p><h3>8 — Unsupervised Doodling and Painting with Improved SPIRAL</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/256/1*8R36orHHFo03kt25RkeHnQ.jpeg" /><figcaption>Source: <a href="https://learning-to-paint.github.io/">https://learning-to-paint.github.io</a></figcaption></figure><p>You may already be familiar with <a href="https://medium.com/syncedreview/gan-2-0-nvidias-hyperrealistic-face-generator-e3439d33ebaf"><strong>generative adversarial networks</strong></a> that create photorealistic high-resolution images. Human drawings<strong>, </strong>on the other hand<strong>, </strong>are rarely<strong> </strong>photorealistic, and yet we’re able to tell what’s in the picture, which means that they somehow <strong>capture the “essence”</strong> of objects. This “essence” is a high-level representation that incorporates human knowledge and structure.</p><p><strong>SPIRAL++</strong> is a GAN framework that learns how to <strong>paint</strong> <strong>like a human artist</strong>. With a limited number of brush strokes and without supervision, the algorithm learns to draw objects that are clearly recognizable<strong> </strong>by humans. This article lets you click on any image painted by the generator network and see the whole process of its creation, stroke by stroke.</p><p>Read <a href="https://learning-to-paint.github.io">Unsupervised Doodling and Painting with Improved SPIRAL </a>— By <a href="https://arxiv.org/search/cs?searchtype=author&amp;query=Mellor%2C+J+F+J">John F. J. Mellor</a> et al.</p><h3>9 — Visualizing Tensor Operations with Factor Graphs</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*uf4d1mYcAeNx2ZrZxLVNZQ.png" /><figcaption>Source: <a href="https://rajatvd.github.io/Factor-Graphs/">https://rajatvd.github.io/Factor-Graphs/</a></figcaption></figure><p>Have you ever felt lost looking at some formula containing multidimensional <strong>tensor operations</strong> and trying to figure out what it does? You’re not alone. Tensor operations can be<strong> </strong>difficult<strong> </strong>to wrap your head around.</p><p>But don’t be discouraged! Here’s a beautiful technique — called <strong>factor graphs</strong> — that produces <strong>powerful visualizations</strong> and helps us understand what’s happening when we work with multi-dimensional arrays of data.</p><p>Read <a href="https://rajatvd.github.io/Factor-Graphs/">Visualizing Tensor Operations with Factor Graphs</a> — From <a href="https://rajatvd.github.io/">Rajat’s Blog</a></p><h3>10 — Smiles beam and walls blush: Architecture meets AI at Microsoft</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/952/1*0XbRlzQqJx7WHFuclQoBxw.jpeg" /><figcaption>Source: <a href="https://blogs.microsoft.com/ai/ada-artist-in-residence/">https://blogs.microsoft.com/ai/ada-artist-in-residence/</a></figcaption></figure><p>The Microsoft Research <a href="https://www.microsoft.com/en-us/research/group/artist-in-residence/">Artist in Residence</a><strong> </strong>program<strong> </strong>developed <strong>Ada</strong>, the first AI-powered pavilion that can <strong>sense our emotions</strong> and <strong>change its colors and lighting </strong>in response.</p><p>Named after <a href="https://en.wikipedia.org/wiki/Ada_Lovelace">Ada Lovelace</a>, Ada is a two-story photo-luminescent structure created using cutting-edge fabrication techniques such as 3D digital knitting. It is able to pick up on our voice tones, choice of words and facial expressions and use that information to <strong>infer our mood in</strong> <strong>real time</strong>. Whether or not it <a href="https://medium.com/@tdietterich/what-does-it-mean-for-a-machine-to-understand-555485f3ad40">actually understands</a> our feelings, it sure looks fascinating!</p><p>Read <a href="https://blogs.microsoft.com/ai/ada-artist-in-residence/">Smiles beam and walls blush: Architecture meets AI at Microsoft</a> — From <a href="https://blogs.microsoft.com/ai/">Microsoft AI Blog</a></p><p><em>This article was originally published on Sicara’s blog: </em><a href="https://www.sicara.ai/blog/10-2019-best-of-ai-october-2019"><em>https://www.sicara.ai/blog/10-2019-best-of-ai-october-2019</em></a></p><p><a href="https://www.sicara.ai/blog/2019-10-21-best-of-ai-september-2019">Read the September edition</a><br><a href="https://www.sicara.ai/blog/2019-08-12-best-ai-july-2019">Read the July edition</a><br><a href="https://www.sicara.ai/blog/2019-07-11-best-ai-june-2019">Read the June edition</a></p><p>Some articles we recently published on our blog:</p><p><a href="https://www.sicara.ai/blog/2019-09-26-face-detectors-dsfd-state-of-the-art-algorithms">Face Detectors: Understand DSFD and the State-of-the-art Algorithms</a><br><a href="http://Determine Your Network Hyper-parameters With Bayesian Optimization">Determine Your Network Hyper-parameters With Bayesian Optimization</a><br><a href="https://www.sicara.ai/blog/2019-28-10-deep-learning-memory-usage-and-pytorch-optimization-tricks">Deep Learning Memory Usage and Pytorch Optimization Tricks</a></p><p><em>Thanks to Hugo L., Fatima K. and Raphaël M.</em></p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=fa5ef17c89ba" width="1" height="1" alt=""><hr><p><a href="https://medium.com/sicara/10-2019-best-ai-new-articles-this-month-fa5ef17c89ba">The Best of AI: New Articles Published This Month (October 2019)</a> was originally published in <a href="https://medium.com/sicara">Sicara&#39;s blog</a> on Medium, where people are continuing the conversation by highlighting and responding to this story.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[Deep Learning Memory Usage and Pytorch optimization tricks]]></title>
            <link>https://medium.com/sicara/deep-learning-memory-usage-and-pytorch-optimization-tricks-e9cab0ead93?source=rss----fd4c083fbb93---4</link>
            <guid isPermaLink="false">https://medium.com/p/e9cab0ead93</guid>
            <category><![CDATA[optimization]]></category>
            <category><![CDATA[machine-learning]]></category>
            <category><![CDATA[pytorch]]></category>
            <category><![CDATA[deep-learning]]></category>
            <category><![CDATA[memory-usage]]></category>
            <dc:creator><![CDATA[Quentin Febvre]]></dc:creator>
            <pubDate>Tue, 29 Oct 2019 10:59:21 GMT</pubDate>
            <atom:updated>2019-12-13T16:46:23.593Z</atom:updated>
            <content:encoded><![CDATA[<h3>Deep Learning Memory Usage and Pytorch Optimization Tricks</h3><h4>Mixed precision training and gradient checkpointing on a ResNet</h4><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*GW3ixBMbPqIIMAshXVsrSw.png" /></figure><p>Read the original article on Sicara’s blog <a href="https://www.sicara.ai/blog/2019-28-10-deep-learning-memory-usage-and-pytorch-optimization-tricks">here</a>.</p><p>Shedding some light on the causes behind CUDA out of memory ERROR, and an example on how to reduce by 80% your memory footprint with a few lines of code in Pytorch</p><h3>Understanding memory usage in deep learning models training</h3><p>In this first part, I will explain how a deep learning models that use a few hundred MB for its parameters can crash a GPU with more than 10GB of memory during their training !</p><p>So where does this need for memory comes from? Below I present the two main high-level reasons why a deep learning training need to store information:</p><ul><li>information necessary to backpropagate the error (gradients of the activation w.r.t. the loss)</li><li>information necessary to compute the gradient of the model parameters</li></ul><h4>Gradient descent</h4><p>If there is one thing you should take out from this article, it is this:</p><blockquote><strong>As a rule of thumb, each layer with learnable parameters will need to store its input until the backward pass.</strong></blockquote><p>This means that every batchnorm, convolution, dense layer will store its input until it was able to compute the gradient of its parameters.</p><h4>Backpropagation of the gradients and the chain rule</h4><p>Now even some layer without any learnable parameters need to store some data! This is because we need to backpropagate the error back to the input and we do this thanks to the chain rule:</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/490/1*93mMuq1M2FFzYQkO50KVfA.png" /><figcaption>Chain rule:(a_i being the activations of the layer i)</figcaption></figure><p>The culprit in this equation is the derivative of the input w.r.t the output. Depending on the layer, it will</p><ul><li>be dependent on the parameters of the layer (dense, convolution…)</li><li>be dependent on nothing (sigmoid activation)</li><li><strong>be dependent on the values of the inputs:</strong> eg MaxPool, ReLU …</li></ul><p>For example, if we take a ReLU activation layer, the minimum information we need is the sign of the input.</p><p>Different implementations can look like:</p><ul><li>We store the whole input layer</li><li>We store a binary mask of the signs (that takes less memory)</li><li>We check if the output is stored by the next layer. If so, we get the sign info from there and we don’t need to store additional data</li><li>Maybe some other smart optimization I haven’t thought of…</li></ul><h4>Example with ResNet18</h4><p>Now let’s take a closer look at a concrete example: The <strong>ResNet18</strong>!</p><p><strong>Continue reading “</strong><a href="https://www.sicara.ai/blog/2019-28-10-deep-learning-memory-usage-and-pytorch-optimization-tricks"><strong>Deep Learning Memory Usage and Pytorch Optimization Tricks</strong></a><strong>”</strong></p><h4>References</h4><p><strong>Memory Usage</strong></p><p><a href="https://www.graphcore.ai/posts/why-is-so-much-memory-needed-for-deep-neural-networks">Why is so much memory needed for deep neural networks?</a></p><p><strong>Gradient Checkpointing</strong></p><ul><li><a href="https://medium.com/tensorflow/fitting-larger-networks-into-memory-583e3c758ff9">Fitting larger networks into memory.</a></li><li><a href="https://qywu.github.io/2019/05/22/explore-gradient-checkpointing.html">Explore Gradient-Checkpointing in PyTorch</a></li></ul><p><strong>Mixed precision training</strong></p><p><a href="https://devblogs.nvidia.com/mixed-precision-training-deep-neural-networks/">Mixed-Precision Training of Deep Neural Networks | NVIDIA Developer Blog</a></p><p>Are you looking for Image Recognition Experts? Don’t hesitate to contact us!</p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=e9cab0ead93" width="1" height="1" alt=""><hr><p><a href="https://medium.com/sicara/deep-learning-memory-usage-and-pytorch-optimization-tricks-e9cab0ead93">Deep Learning Memory Usage and Pytorch optimization tricks</a> was originally published in <a href="https://medium.com/sicara">Sicara&#39;s blog</a> on Medium, where people are continuing the conversation by highlighting and responding to this story.</p>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[The Best of AI: New Articles Published This Month (September 2019)]]></title>
            <link>https://medium.com/sicara/best-ai-september-2019-45a1b457cb5d?source=rss----fd4c083fbb93---4</link>
            <guid isPermaLink="false">https://medium.com/p/45a1b457cb5d</guid>
            <category><![CDATA[articles]]></category>
            <category><![CDATA[data-science]]></category>
            <category><![CDATA[ai]]></category>
            <category><![CDATA[best-of]]></category>
            <category><![CDATA[big-data]]></category>
            <dc:creator><![CDATA[Nicolas Fley]]></dc:creator>
            <pubDate>Wed, 23 Oct 2019 10:21:51 GMT</pubDate>
            <atom:updated>2019-12-13T16:45:19.190Z</atom:updated>
            <content:encoded><![CDATA[<h4>10 data articles handpicked by the Sicara team, just for you</h4><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*bD_lyWroTAQ0oHnvGg-t3Q.png" /></figure><p>Read the full article on Sicara’s blog <a href="https://www.sicara.ai/blog/2019-10-21-best-of-ai-september-2019">here</a>.</p><p>Welcome to the September edition of <strong>our best and favorite articles in AI that were published this month</strong>. We are a Paris-based company that does Agile data development. This month, we spotted articles about <strong>AI surveillance, Deepfake, a documentary from the 60s and much more</strong>. Let’s kick off with the comic of the month:</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/454/1*Q2zJlhrAxIis5LUVDoxkCA.png" /><figcaption>From <a href="https://xkcd.com/1626/">xkcd</a></figcaption></figure><h3>1–1960 documentary on AI as seen in 1960</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/596/0*nnztSGgTrteMJN7q.jpg" /></figure><p>Let’s jump in <strong>1960</strong>, we are ten years from HAL 9000 and the first personal computers but people are already thinking about the<strong> emergence of Artificial Intelligence</strong>. From the late 1950s to the early 1960s, newspapers were full of articles about it.</p><p>It’s during that time that “<strong>The Thinking Machine</strong>” first aired. Back then we were already thinking about recognizing <strong>handwriting</strong>, <strong>playing games</strong> (checkers) and <strong>telling stories</strong>. This movie feels surprisingly contemporary, I advise you to take a look at it.</p><p>Read <a href="https://www.fastcompany.com/90399709/to-understand-ai-in-2019-watch-this-1960-tv-show">1960 documentary on AI as seen in 1960</a> — By <a href="https://twitter.com/harrymccracken?lang=fr">Harry McCracken</a>.</p><h3>2 — TensorFlow 2.0 is released</h3><figure><img alt="Résultat de recherche d&#39;images pour &quot;tensorflow v2&quot;" src="https://cdn-images-1.medium.com/proxy/1*-QTg-_71YF0SVshMEaKZ_g.png" /></figure><p>Google announced the <strong>final version of Tensorflow 2.0</strong>. It provides a comprehensive ecosystem of tools for developers, companies, and researchers who want to push the <strong>state-of-the-art in machine learning</strong> and <strong>build scalable</strong> ML-powered applications.</p><p>This new version includes :</p><p>. Ability to <strong>run models from any device</strong> (from an iPhone to a server in the cloud)<br>. Up to <strong>3x</strong> <strong>faster training performance</strong><br>. Standard <strong>dataset interface</strong><br>. <strong>Eager execution</strong> as default<br>. Tight <strong>integration of Keras<br></strong>.<strong> </strong>Access to TensorFlow’s <strong>low-level API</strong></p><p>Read <a href="https://medium.com/tensorflow/tensorflow-2-0-is-now-available-57d706c2a9ab">TensorFlow 2.0 is released</a> — By <a href="https://medium.com/@tensorflow">TensorFlow</a>.</p><p>Notice that PyTorch 1.3 has been released the 10th of October.</p><h3><strong>3 — How to make technology work for society ?</strong></h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*BW23wmKZBqgNsElU.jpg" /></figure><p>A lot of articles warn us on the negative impact of Artificial Intelligence over <strong>employment</strong>. MIT has <strong>published a report</strong> which tries to answer an enlightening question: how does <strong>Artificial Intelligence may help</strong> American <strong>build better careers</strong> as technological changes occur?</p><p>It moves the debate describing opportunities to <strong>suppress low-skilled job</strong>. It also makes the <strong>distinction</strong> between <strong>productive innovations</strong> and innovation which only <strong>perform as good as humans</strong> on effortless jobs (they are called <strong>“s<em>o-so technology</em>”)</strong>. For instance, <strong>self-check-out</strong> at pharmacies or supermarkets are making a <strong>less interesting</strong> improvement than an efficient <strong>waste sorting</strong>.</p><p>It ends by <strong>proposing policies</strong> to help AI and human task force be an <strong>efficient combination</strong> rather than a fight for employment. <strong>Take a look</strong> at their blog post if you want to understand the point of view of the MIT on this subject and see their advice.</p><p>Read <a href="http://news.mit.edu/2019/work-future-report-technology-jobs-society-0904">How to make technology work for society</a>— From <a href="http://news.mit.edu">MIT news</a></p><h3>4 — Detecting patient pain</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/762/0*y5QH57MemzT9ItDY.jpeg" /></figure><p>As improvements are done in the field of operating machines only with our <strong>brain</strong>, a team of researcher from MIT and elsewhere has developed a system that <strong>detects patient pain by analyzing brain activity</strong>.</p><p>This huge <strong>progress</strong> could <strong>help doctors diagnose and treat pain</strong> in unconscious and non-communicative patients. The model, based on the hemoglobin oxygenation, also allows generating “personalized” submodels to fit our different perceptions. It has a <strong>87% accuracy</strong> and may soon be <strong>used in hospital</strong>, according to Dr Lopez-Martinez.</p><p>Read <a href="https://news.mit.edu/2019/detecting-pain-levels-brain-signals-0912">Detecting patient pain</a>— From <a href="https://news.mit.edu">MIT News</a></p><h3>5 — Facebook and Microsoft join forces to fight Deepfake</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/800/0*Hr8xFEpr8zk85aiV.jpeg" /></figure><p>In July we talked about <a href="https://www.sicara.ai/blog/2019-08-12-best-ai-july-2019">the <strong>best and scariest</strong> example of AI-Enabled Deepfake</a>. In September, <strong>Microsoft</strong> and <strong>Facebook</strong> have provided a huge open-source <strong>labelled dataset</strong> to allow collective brainpower to work on this new threat. They are also <strong>funding</strong> this project with more than <strong>$10 million</strong>.</p><p>Read <a href="https://ai.facebook.com/blog/deepfake-detection-challenge/">Facebook and Microsoft join forces to fight Deepfake</a> — From <a href="https://ai.facebook.com/blog">Facebook Blog</a></p><h3>6 — US government overcomes European GDPR law</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*w9WQ-ZIdIAqg2MQb.jpg" /></figure><p>If you live in a European country, you must have heard about General Data Protection Regulation (<strong>GPDR</strong>), a law which aims to <strong>protect users privacy</strong>. It has significantly bolstered consumer rights.<br>The <strong>CLOUD Act</strong>, introduced last year, is now <strong>threatening this law</strong>. It <strong>allows the American</strong> government to retrieve any information saved in a US company datacenter.<br>More and more affairs show that <strong>US government uses this law to overcome the GDPR</strong>, threatening the link between Brussel and Washington.</p><p>From <a href="https://www.european-views.com/2019/09/europe-continues-to-wrestle-with-the-long-arm-of-american-law/">US government overcomes European GDPR law</a> — By <a href="https://www.european-views.com/author/european-views/">European Views</a></p><h3>7 — Google Fined for targetting children</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/600/0*hL_pRReHUeleMy31.jpg" /></figure><p>Did you know that one of the <strong>most popular</strong> <strong>YouTube channels</strong> in the US is a<strong> kid channel</strong>? During the past year, this type of channels had an exponential growth. This September YouTube has been <strong>fined $170m</strong> to settle allegations it <strong>collected children’s data</strong> without their parents’ consent.</p><p>It’s important to understand that the <strong>illegal harvesting</strong> of <strong>children’s data</strong> was “extremely lucrative” for Google. This case and the <strong>presence of paedophiles</strong> on the platform forced YouTube to <strong>stop the monetization</strong> of these videos and to <strong>block comment access</strong>.</p><p>This shows how the US wants to <strong>protect children</strong> from the persuasive strength of AI-based targeted ads.</p><p>Read <a href="https://www.theguardian.com/technology/2019/sep/04/youtube-kids-fine-personal-data-collection-children-">Google Fined for targetting children</a>— By <em>Associated Press</em></p><h3>8 — The Global Expansion of AI Surveillance</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/1000/0*YxzoXW-zF4HXj6uR.jpg" /></figure><p>In 1984, when George Orwells wrote its “Big Brother” dystopia, the fear of the <strong>global surveillance</strong> of the population has been put in our brains.</p><p>This <strong>report</strong>, published by <strong>Carnegie</strong>, studies the <strong>evolution of AI Surveillance</strong> over 176 countries with questions such as <strong>which and how governments are using it</strong>. The number of countries using it is rapidly <strong>increasing</strong>, and according to the report, already 64 countries are using <strong>facial recognition</strong>.</p><p><strong>Carnegie has summed up his report through 8 key-finding.</strong> Read <a href="https://carnegieendowment.org/2019/09/17/global-expansion-of-ai-surveillance-pub-79847">The Global Expansion of AI Surveillance </a>— From <a href="https://carnegieendowment.org">Carnegie</a></p><h3>9 — New Google Multilingual Speech recognition</h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*h1sH8EIccjKmT_d-.jpeg" /></figure><p>The <strong>vocal assistant</strong> market is still growing rapidly, some <strong>challenging points</strong> that this technology is yet to overcome are the <strong>size of the model</strong>, the need for a <strong>model for each language</strong>, and <strong>latency</strong>.</p><p>Google made improvements towards solving these issues with a <strong>new end to end model</strong> which is a single model to understand them all. The experimentation has been done in India, this model can understand <strong>9 different languages</strong> with <strong>low latency</strong> and <strong>fewer parameters</strong> than other states of the art models.</p><p>The learning is done by providing labelled speech. To prevent bias from the unequal distribution of training data by language, the <strong>architecture</strong> has been built to <strong>separate different languages</strong>. <strong>It outperforms state of the art monolingual models</strong>.</p><p>Read <a href="http://ai.googleblog.com/2019/09/large-scale-multilingual-speech.html">New Google Multilingual Speech recognition</a> — From <a href="https://ai.googleblog.com">Google AI blog</a></p><h3><strong>10 — Artificial Intelligence Can’t Think Without Polluting</strong></h3><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*GpgLb5-_aXkF0aX_.jpeg" /></figure><p>As artificial intelligence growth, the amount of <strong>computation</strong> used to train models is also <strong>continuously getting bigger and bigger</strong>. Especially in leading companies which use a <strong>lot of energy</strong> to create their state of the art models.</p><p>Even if for now, AI does not seems to weight much of the <strong>global energy balance</strong>, we may start to think about <strong>its impact</strong>. Multiple solutions exist, like starting to <strong>measure power consumption</strong> and to <strong>reward efficiency</strong> instead of accuracy.</p><p>This article is linked to the 3rd article (How to make technology work for society), it may become more and more important to <strong>focus on impactful models</strong> and stop using our energy and our time on &quot;<em>so-so Technologies</em>&quot;.</p><p>Read <a href="https://slate.com/technology/2019/09/artificial-intelligence-climate-change-carbon-emissions-roy-schwartz.html">Artificial Intelligence Can’t Think Without Polluting</a>— By <a href="https://slate.com/author/april-glaser">April Glaser</a></p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=45a1b457cb5d" width="1" height="1" alt=""><hr><p><a href="https://medium.com/sicara/best-ai-september-2019-45a1b457cb5d">The Best of AI: New Articles Published This Month (September 2019)</a> was originally published in <a href="https://medium.com/sicara">Sicara&#39;s blog</a> on Medium, where people are continuing the conversation by highlighting and responding to this story.</p>]]></content:encoded>
        </item>
    </channel>
</rss>