This really isn’t a Go problem. As @calmh said, first figure out how you can solve the problem in any language.
Processing video and audio is hard, and magically detecting quality is also hard. How do you determine if a video image is good or bad quality? What makes one frame good quality and another bad? How do you distinguish between noise and good video?