yob.id.au: Thinking Sphinx And Unicode
Actually, the useful thing here for me isn’t Sphinx, but character folding, i.e., the lossy conversion of Unicode codepoints to ASCII near-equivalents, which is something that needs to be done for at work when supplying contacts to the Afilias EPP servers, which only support <postalInfo type="int"/>
with contacts.