May 21, 2019 – Danvers, Mass. – Copyright Clearance Center, Inc. (CCC), a leader in advancing copyright, accelerating knowledge, and powering innovation, is sponsoring and hosting the next monthly Boston Apache Spark User Group Meetup at 6:30 pm on 30 May 2019 at its headquarters at 222 Rosewood Drive in Danvers.

The Boston Apache Spark User Group is open to technology professionals in the Greater Boston area interested in Apache Spark – what it is, what it does, and what other people are doing with it. Recent Apache Spark meetups were sponsored by WB Games Boston, Quantum Black, and McGraw Hill.

The 30 May meetup will feature presentations by CCC’s Matt Kleiderman, Director of Architecture, and Glenn Street, Data Architect. Kleiderman will speak on “Author Disambiguation in a Knowledge Graph,” covering how to build a knowledge graph of authorships and citations. Street’s presentation, “How to (Not) Light a Pile of Money on Fire Using On-Demand Web Services for ETL,” will describe how Street and his colleagues learned to make the most of on-demand serverless ETL and will offer tips for keeping costs under control.

WB Games’ Principal Big Data Engineer, Nick Afshartous, and Helen Liu, computer science major at Northeastern University and currently on co-op with WB Games, will also present an approach to auto-scaling Amazon EMR clusters running Spark Streaming.

About CCC

A pioneer in voluntary collective licensing, CCC helps organizations integrate, access, and share information through licensing, content, software, and professional services. With expertise in copyright, information management, artificial intelligence, and machine learning, CCC and its subsidiary RightsDirect collaborate with stakeholders to design and deliver innovative information solutions that power decision-making by harnessing information from a wide variety of data sources and content assets.