With CMAP installed, the output file remained containing Chinese characters as raw cid code (CID:xxx). While debuging, I found that PDFPageInterpreter.fontmap['F3'].cid2unicode a dictionary of 200+ ...