<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:cc="http://cyber.law.harvard.edu/rss/creativeCommonsRssModule.html">
    <channel>
        <title><![CDATA[Stories by Volodymyr Miz on Medium]]></title>
        <description><![CDATA[Stories by Volodymyr Miz on Medium]]></description>
        <link>https://medium.com/@mizvladimir?source=rss-3502d10e47ee------2</link>
        <image>
            <url>https://cdn-images-1.medium.com/fit/c/150/150/1*npi3ybhaFRNWi49uirsAwQ.jpeg</url>
            <title>Stories by Volodymyr Miz on Medium</title>
            <link>https://medium.com/@mizvladimir?source=rss-3502d10e47ee------2</link>
        </image>
        <generator>Medium</generator>
        <lastBuildDate>Thu, 28 May 2026 03:13:43 GMT</lastBuildDate>
        <atom:link href="https://medium.com/@mizvladimir/feed" rel="self" type="application/rss+xml"/>
        <webMaster><![CDATA[yourfriends@medium.com]]></webMaster>
        <atom:link href="http://medium.superfeedr.com" rel="hub"/>
        <item>
            <title><![CDATA[How to integrate Apache Spark, Intellij Idea and Scala]]></title>
            <link>https://medium.com/@mizvladimir/how-to-integrate-apache-spark-intellij-idea-and-scala-bf5edef8c4c?source=rss-3502d10e47ee------2</link>
            <guid isPermaLink="false">https://medium.com/p/bf5edef8c4c</guid>
            <category><![CDATA[scala]]></category>
            <category><![CDATA[scala-programming]]></category>
            <category><![CDATA[intellij-idea]]></category>
            <category><![CDATA[apache-spark]]></category>
            <category><![CDATA[spark]]></category>
            <dc:creator><![CDATA[Volodymyr Miz]]></dc:creator>
            <pubDate>Tue, 30 Aug 2016 00:00:00 GMT</pubDate>
            <atom:updated>2017-04-21T10:00:09.273Z</atom:updated>
            <content:encoded><![CDATA[<p><a href="http://jupyter.org/">Jupyter Notebook</a> is really useful when you want to present some code, let someone reproduce your results or just learn how to use new tools and libraries. I use Jupyter almost every day and, as many others, I started learning Spark and developed my first data analysis pipelines using interactive notebooks and Python API. Then I realized that I want more and running notebooks locally is not enough for me, so I signed up for <a href="https://databricks.com/try-databricks">Databricks Community Ediditon</a> subscription. Databricks allows to forget about the problems related to setting up and maintaining the environment.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/894/0*1U_0Mf9z16sx8f-8.png" /></figure><p>Everyone who is learning and using Spark eventually realize that Python API is not as powerful and flexible as the core language of the framework — Scala. This language allows to start feeling the full power of Spark comprising Analytics, Streaming and Graph processing tools. However, Spark is just yet another framework for large scale data analytics. Yes, it is convenient and powerful, but it has a limited number of algorithms and sometimes you need to implement your own custom algorithm. And that is the moment when you need an IDE.</p><p>You can find an example project in my <a href="https://github.com/mizvol/SparkTest">Git repository</a>. It allows you to get started with Spark Scala development in Intellij Idea. Alternatively, you can follow step-by-step instructions on my <a href="http://blog.miz.space/tutorial/2016/08/30/how-to-integrate-spark-intellij-idea-and-scala-install-setup-ubuntu-windows-mac/">blog</a> and create this project from scratch by yourself.</p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=bf5edef8c4c" width="1" height="1" alt="">]]></content:encoded>
        </item>
    </channel>
</rss>