Hi there,
The requirement is quite clear and straightforward to implement. The only questions where would you prefer this to be run, on a server where you can access with browser or on your Windows desktop? Based on your decision I'll either use python or c# respectively since. We just need to preprocess the recordings to convert them to the format google speech requires.
Anyway, this'll be ready in 2 days at most, thanks.