Edit

Share via


Analyze data in Azure Data Lake Storage Gen2 by using Power BI

In this article, you learn how to use Power BI Desktop to analyze and visualize data stored in a storage account that has a hierarchical namespace (Azure Data Lake Storage Gen2).

Prerequisites

Before you begin this tutorial, you must have the following prerequisites:

  • An Azure subscription. Go to Get Azure free trial.
  • A storage account that has a hierarchical namespace. Follow the instructions at Create a storage account to create one. This article assumes that you've created a storage account named contosoadlscdm.
  • Ensure you are granted one of the following roles for the storage account: Blob Data Reader, Blob Data Contributor, or Blob Data Owner.
  • A sample data file named Drivers.txt located in your storage account. You can download this sample from Azure Data Lake Git Repository, and then upload that file to your storage account.
  • Power BI Desktop. You can download this application from the Microsoft Download Center.

Create a report in Power BI Desktop

  1. Launch Power BI Desktop on your computer.

  2. To connect to the sample data, follow the instructions in the Azure Data Lake Storage Gen2 connector article.

  3. From the Power Query Navigator, select Load.

    Screenshot of the Navigator, with data from the Drivers.txt file displayed.

  4. After the data is successfully loaded into Power BI, the following fields are displayed in the Fields panel.

    Screenshot of the Fields panel, showing Query1, which contains Date accessed, Date created, Date modified, Extension, Folder Path, and Name fields.

    However, to visualize and analyze the data, you might prefer the data to be available using the following fields.

    Screenshot of a data table, with names and addresses from the Content column in the original table displayed in rows.

    In the next steps, you update the query to convert the imported data to the desired format.

  5. From the Home tab on the ribbon, select Transform Data. The Power Query editor then opens, displaying the contents of the file.

    Screenshot of the Power Query editor, with the query created from the original Drivers.txt file.

  6. In the Power Query editor, under the Content column, select Binary. The file is automatically detected as CSV and contains the following output. Your data is now available in a format you can use to create visualizations.

    Screenshot of the Power Query editor, showing the data from the Binary field expanded to a table with names and addresses, with Imported CSV and Changed Type added to the Applied Steps panel.

  7. From the Home tab on the ribbon, select Close & Apply.

    Screenshot of the Power Query editor, showing the Close & Apply option.

  8. Once the query is updated, the Fields tab displays the new fields available for visualization.

    Screenshot of the Fields pane in Power BI, with Query1 now displaying Column1 through Column8.

  9. Now you can create a pie chart to represent the drivers in each city for a given country/region. To do so, make the following selections.

    From the Visualizations tab, select the symbol for a pie chart.

    Screenshot of the Visualizations pane, with the pie chart symbol emphasized.

    In this example, the columns you're going to use are Column 4 (name of the city) and Column 7 (name of the country/region). Drag these columns from the Fields tab to the Visualizations tab.

    Screenshot with Column4 dragged and dropped in Details. Column7 is dragged and dropped in Values in the Visualization pane. Column7 is also dragged and dropped into Filters on this page in the Filters pane.

    The pie chart should now resemble the one in the following image.

    Screenshot of pie chart showing the percentages of each slice of the pie, along with color coding of each of the locations, and a column containing all locations and their color code on the right side.

  10. If you select a specific country/region from the page level filters, the number of drivers in each city of the selected country/region is displayed. For example, under the Visualizations tab, under Page level filters, select Brazil.

    Screenshot of the Filters on this page pane, with Brazil selected as the country/region to display.

  11. The pie chart is automatically updated to display the drivers in the cities of Brazil.

    Screenshot of pie chart with the percentages of each slice of the pie for locations in Brazil, along with a list of all locations in Brazil and their color code in a column on the right side.

  12. From the File menu, select Save to save the visualization as a Power BI Desktop file.

Publish report to Power BI service

After you create the visualizations in Power BI Desktop, you can share it with others by publishing it to the Power BI service. For instructions on how to do that, go to Publish from Power BI Desktop.