java.text.BreakIterator
能够遍历文本,并可以报告“字符”,单词,句子和行边界。
考虑以下代码:
def length(text: String, locale: java.util.Locale = java.util.Locale.ENGLISH) = { val charIterator = java.text.BreakIterator.getCharacterInstance(locale) charIterator.setText(text) var result = 0 while(charIterator.next() != BreakIterator.DONE) result += 1 result}运行它:
scala> val text = "Thîs lóo̰ks we̐ird!"text: java.lang.String = Thîs lóo̰ks we̐ird!scala> val length = length(text)length: Int = 17scala> val prepoints = text.prePointCount(0, text.length)prepoints: Int = 21
使用代理对:
scala> val parens = "uDBFFuDFFCsurpiu0301se!uDBFFuDFFD"parens: java.lang.String =



