Operations with petl: Advanced Extractions & Transformations

Petl 1.6    |    Intermediate
  • 10 videos | 1h 15m 43s
  • Includes Assessment
  • Earns a Badge
Rating 4.1 of 7 users Rating 4.1 of 7 users (7)
Petl facilitates and streamlines tasks related to data extraction and manipulation, often required by software developers to make data fit for actionable business intelligence (BI). In this course, you'll work with complex operations in petl and outline how to extract data from a source and convert it to a format that complies with your requirements. You'll begin by investigating the use of regular expressions to analyze, search, and extract specific rows and columns in a petl table. You'll then create transform functions and apply them to your data. These include operations on numeric as well as string fields. Moving on, you'll implement sort operations to organize data in a petl table and arrange it in a sequence that suits your purposes. Finally, you'll investigate how to perform joins and set operations on data tables and meaningfully reduce the data in them using aggregation functions.

WHAT YOU WILL LEARN

  • Discover the key concepts covered in this course
    Access regular expression patterns in data tables
    Implement regular expression searches on petl data tables
    Implement split operations on data stored within petl data tables
    Unpack nested fields when creating data tables
  • Use mapping to perform various operations on columns and fields in data tables
    Transform data by rows using rowmap() and rowmapmany() functions
    Perform sort operations on data stored within a data table
    Implement sql-like joins on data from multiple petl tables
    Summarize the key concepts covered in this course

IN THIS COURSE

  • 2m 50s
    In this video, you’ll learn more about your instructor and this course. In this course, you’ll learn more about the complex operations in PETL when it comes to extracting data from a source and converting it to a format that complies with your requirements. You’ll start by delving into the use of regular expressions to analyze, search for, and extract specific rows and columns in your petl table, based on the patterns you define. FREE ACCESS
  • 10m 6s
    In this video, you’ll learn more about ETL operations. You’ll learn that working with unstructured data and regular expressions can be a powerful addition to your toolkit. You’ll learn that operations like searching for a pattern, performing search and replace operations, getting a sense of the formats of values in a field are all important in an ETL context. You’ll learn how petl supports these. FREE ACCESS
  • Locked
    3.  Performing Searches in petl Using Regular Expression
    4m 44s
    In this video, you’ll watch a demo. In this demo, you’ll start where you left off at the end of the last one. You’ll find the rows that contain a pattern. You’ll return to etl.search, with a different variant. Here, you’ll search for a pattern in a column. You’ll pass in not only the petl table, but also the name of the column, which is 'nameSuffix'. Next, you’ll examine the return values. FREE ACCESS
  • Locked
    4.  Splitting Fields in petl Using Regular Expressions
    9m 29s
    In this video, you’ll watch a demo. In this demo, you’ll learn operations where you’ll modify data based on patterns present within it. You’ll learn that search and replace is closely related to etl.search. Next, you’ll move to the etl.sub function, which is a search and replace operation. Onscreen, you’ll see etl.sub. We've passed in our petl table called basketball_data. Then, you’ll see there’s a column name. Next, you’ll see the search and replace parameters. FREE ACCESS
  • Locked
    5.  Unpacking Nested Fields in petl
    5m 14s
    In this video, you’ll watch a demo. In this demo, you’ll work with some specialized ETL functions. Specifically, you’ll learn to unpack a nested field. Onscreen, you’ll see a dataset. This is a list of lists called names_of_players. You’ll notice that this dataset’s second column is in the form of a nested field. Each individual player's name is mentioned as an array, and that array consists of the first name, middle name and last name. FREE ACCESS
  • Locked
    6.  Converting Fields in petl Data Tables Using Mapping
    9m 23s
    In this video, you’ll watch a demo. In this demo, you’ll continue with the theme of transforming data using petl. You’ll learn about radical transformations that perform complete makeovers of the data. The output will look very different from the input. Onscreen you’ll see the required import statements. These include two familiar ones: pandas as pd and petl as etl and a new one. From collections, you’ll import OrderedDict. This is a special Python dictionary. FREE ACCESS
  • Locked
    7.  Transforming Data by Rows in petl
    10m 51s
    In this video, you’ll watch a demo. In this demo, you’ll focus on row-oriented transformations rather than column-oriented transformations. You’ll continue using the same dataset you used in the previous demo. Onscreen, you’ll see a function called rowmapper. It will be applied to the input data. The rowmapper function will operate on the input data, one row at a time. Now, you’ll learn what the rowmapper function does. FREE ACCESS
  • Locked
    8.  Sorting Data in petl Data Table
    10m 7s
    In this video, you’ll watch a demo. In this demo, you’ll learn about sorting operations. You’ll use the same dataset from the previous demo. This dataset is in a petl table read in from a pandas DataFrame. You’ll start with a simple sorting operation, etl.sort. It takes a petl table in and performs a sort across the columns in the input table. You don’t need to specify a subset of the columns to sort. FREE ACCESS
  • Locked
    9.  Joining petl Data Tables
    10m 58s
    In this video, you’ll watch a demo. In this demo, we will turn our attention to joins. You’ll need the import of petl as etl. Onscreen now is a list of lists, called trees_legal_status. It consists of data with three columns: 'Number', 'TreeID' and 'LegalStatus'. You’ll add a second list of lists. This one is called street_trees, and this one has a slightly different set of columns: 'Number', 'TreeID', 'Address', 'SiteOrder', and 'Caretaker'. FREE ACCESS
  • Locked
    10.  Course Summary
    2m 3s
    In this video, you’ll summarize what you learned in the course. In this course, you learned the use of regular expressions to analyze, search for, and extract specific rows and columns in your petl table based on the patterns you defined. You then learned to transform and sort data. You also learned to implement sort operations to order data in a format you need. Finally, you implemented join operations in petl. FREE ACCESS

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

YOU MIGHT ALSO LIKE

Rating 4.6 of 13 users Rating 4.6 of 13 users (13)
Rating 4.6 of 48 users Rating 4.6 of 48 users (48)
Rating 4.7 of 12 users Rating 4.7 of 12 users (12)

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Rating 3.8 of 5 users Rating 3.8 of 5 users (5)
Rating 4.4 of 34 users Rating 4.4 of 34 users (34)
Rating 4.8 of 37 users Rating 4.8 of 37 users (37)