Improve string normalization, and use that for airport name matching
So far we were just doing case folding, now we also do Unicode decomposition to remove diacritic marks. This reduces the airport string table size by ~5% without compromising quality. This approach should also be helping for matching non-ASCII names in IATA boarding passes to their normal spelling.
parent
4cb73c44
Please register or sign in to comment