How do I go about training a dance classifier with with video files? Are there any components for audio, video to get started with?

You can definitely build a dance classifier using the action classifier in Create ML. See the following session from WWDC 2020:

Build an Action Classifier with Create ML

