An article from jgp: Chapter 9 still covers Spark ingestion (like chapter 7 and chapter 8), but this time, it’s about “anything can become a Spark datasource.” When I was […]
Ingestion of data from databases into Apache Spark
An article from jgp: Chapter 8 of Spark with Java is out and it covers ingestion, as did chapter 7. However, as chapter 7 was focusing on ingestion from files, […]
File Ingestion in Apache Spark
An article from jgp: In a typical Big Data analytics scenario, you will probably be tempted to ingest files. You know, those pesky CSV files where the comma is sometimes […]
Apache Spark with Java
An article from jgp: Apache Spark has been a game changer for distributed data processing, thanks to an easy to understand API, a focus on simplicity, and an adoption of […]
Apache Spark Maturity on the Rise
An article from jgp: Spark Summit Europe 2017 just concluded, here, in Dublin. More than 102 speakers, 1200 attendees, and an impressive Databricks team attended the 3-day long celebration. Spark […]
Spark is Making Big Data Easy at NCDevCon
An article from jgp: NCDevCon is a yearly event in the Triangle, targeted for developers of all breeds, from front-end to back-end. Its origin starts in the ol’ days of […]
Loading CSV in Spark
An article from jgp: Loading CSV in Apache Spark is a standard feature since version 2.0, previously you required a plugin (provided by Databricks). Although it starts with a basic […]
A New Dimension for Apache Spark Clusters
An article from jgp: Summer has been busy and it’s now behind us. I won’t annoy you with all the details of what happened but I wanted to come back […]
A Deep-Dive Introduction to Spark for RDBMS Users
An article from jgp: Earlier in the summer, I start a series of articles for IBM developerWorks. Those articles focus on Apache Spark from a RDBMS user perspective, of course, […]
Getting Ready for This Pint of Guinness
An article from jgp: Next month, I’ll be heading to Dublin, the capital of Ireland. I have been to Ireland quite a few times – I was 3 the first […]