Sampling Random Thoughts
About
Anatoliy Plastinin’s Blog
Categories
All
(9)
Docker
(3)
JSON
(2)
Kafka
(2)
Marathon
(1)
Mesos
(1)
Scala
(1)
Spark
(6)
Spark SQL
(2)
Spark Streaming
(1)
Terraform
(1)
anti-patterns
(1)
architecture
(1)
big data
(1)
localstack
(1)
parquet
(2)
Getting started with Terraform locally
Terraform
localstack
Docker
Quick demo on how to start playing with terraform using local development environment
Jul 10, 2020
Using Spark SQL and Spark Streaming together
Spark
Spark Streaming
Spark SQL
Kafka
Docker
JSON
Tutorial on how to get started with Spark SQL, Spark Streaming and Kafka using Docker
Oct 15, 2017
spark-shell without Spark
Spark
Scala
Small trick how to start playing with Spark APIs without having spark distribution installed
Nov 10, 2016
When Data Driven App Smells Bad
anti-patterns
big data
architecture
What can go wrong with data driven projects. Lessons learned from failed project.
Apr 29, 2016
Spark SQL and Parquet files
Spark
parquet
Tiny note on how to deal with Parquet files with Spark
Feb 29, 2016
Processing JSON data with Spark SQL
Spark
Spark SQL
JSON
Deep dive into JSON support in Spark SQL
Jan 30, 2016
How to Write Data into Parquet
parquet
An example of how to write data into Apache Parquet format
Dec 2, 2015
Running spark-shell in browser with Apache Mesos and Marathon
Spark
Mesos
Marathon
Small trick on how to run spark-shell a web app using Mesos and Marathon.
Nov 13, 2015
Getting started with Spark Streaming using Docker
Spark
Kafka
Docker
Step-by-step guide on how to get started with Spark Streaming and Kafka using Docker environment
Oct 5, 2015
No matching items