I think altering the initial auto-pronunciation to {word} の {kanji} e.g. 「べんり」の「り」 would be much more helpful than just sounding the word itself.
Reason being:
When words with multiple kanji come up, you can't tell from the initial audio which part is being taught / asked for.
I think this is helping to cause false associations - e.g. having to write 便 or 利 after hearing べんり makes the two kanjis' readings more difficult to keep separate and therefore easier to confuse.It also negates many of the advantages of the audio when you always need to look at the