HBase Essential Training

Welcome from HBase Essential Training by Ben Sullins …is now available on LinkedIn Learning: https://www.linkedin.com/learning/hbase-essential-training and on Lynda.com: https://www.lynda.com/HBase-tutorials/HBase-Essential-Training/609017-2.html Apache HBase is the Hadoop database—a NoSQL database management system that runs on top of HDFS (Hadoop Distributed File System). Like Hadoop, HBase is an open-source, distributed, versioned, column-oriented store. Companies such as Facebook, Adobe, and […]

Python for Data Science Tips, Tricks, & Techniques

Welcome from Python for Data Science Tips, Tricks, & Techniques by Ben Sullins Modern work in data science requires skilled professionals versed in analysis workflows and using powerful tools. Python can play an integral role in nearly every aspect of working with data—from ingest, to querying, to extracting and visualizing. This course highlights twelve tips and tricks […]

Hadoop for Data Science Tips, Tricks, & Techniques

Welcome from Hadoop for Data Science Tips, Tricks, & Techniques by Ben Sullins Hadoop—the hugely popular big data platform—offers a vast array of capabilities designed to help data scientists deliver their insights. In this course, Ben Sullins helps you get up to speed with Hadoop by sharing a series of tips and tricks for doing data […]

Presto Essentials: Data Science

Netflix and Airbnb both use Presto—an open-source SQL query engine developed by Facebook—for their ever-expanding big data querying needs. In this course, learn how to harness the power of your big data system using the Presto platform, which breaks the false dilemma of having to choose between an expensive commercial solution that offers fast analytics, […]

Looker: First Look

Looker—a powerful data analytics platform—can help both large and small companies glean value from their data. In this short course, get up to speed with Looker, and learn how to leverage this platform to make collecting, visualizing, and analyzing data a bit easier. Ben Sullins begins by explaining how and why Looker is used, and […]

Kafka Essential Training

released 2017-05-01 Developed at LinkedIn, Apache Kafka is a distributed streaming platform that provides scalable, high-throughput messaging systems in place of traditional messaging systems like JMS. In this course, examine all the core concepts of Kafka. Ben Sullins kicks off the course by making the case for Kafka, and explaining who’s using this efficient platform […]

Integrating Tableau and R for Data Science

R is known as one of the most robust statistical computing solutions out there. Tableau—a leading business intelligence platform—provides excellent data visualization and exploration capabilities. When combined, Tableau and R offer one of the most powerful and complete data analytics solutions in the industry today, providing businesses with unparalleled abilities to see and understand their […]

Apache Spark Essential Training

Apache Spark is a powerful platform that provides users with new ways to store and make use of big data. In this course, get up to speed with Spark, and discover how to leverage this popular processing engine to deliver effective and comprehensive insights into your data. Instructor Ben Sullins provides an overview of the […]

Data Engineering Essential Training for Data Science

53m General Released: January 23, 2017 Approach big data with confidence by mastering the core skills needed to put data to work for your business. This course covers the basics of data engineering, system design, analytics, and business intelligence. Data science expert Ben Sullins explains how to collect and organize your data so you can […]

Analyzing Big Data with Hive

1h 53m General Released: January 20, 2017 From the early days of Big Data, it has been a challenge to find ways that allow many different types of people and professions to work with the data, that was until Facebook invented Hive, which is a sequel language that actually processes and analyzes data in Hadoop. […]

1 2 3