I'm off to bed now, but in case anyone has thoughts about this I'd be all ears:
Any ideas on how to do very simple human voice recognition? I just want to detect whether an audio stream is likely to be a voice or not, to improve the accuracy over a simple volume based approach that most chat things use.
The best I've come up with is checking the largest frequency bin and whether it lies in a normal vocal range (100-8k Hz), but that seems like it'd also have lots of false positives.