Hive QL to Azure Data Factory Data Flow

Here considering a sql select query with 5 joins. i have one fact and 4 dimensions to join

  • Fact Population
  • Dimension State
  • Dimension Countyname
  • Dimension Race
  • Dimension Sex
  • First create a pipeline
  • Create a New data flow
  • connect to fact first as source
  • Select join
  • Configure state as another source
  • Connect the join columns
  • Do the same for other dimensions as below image.
  • Once joins are created then save and publish
  • Then go to pipeline and trigger once

Originally published at https://github.com.

--

--