pdf.js get info about embedded fonts -
i using pdf.js. fetching text blocks font info
object { str: "blabla", dir: "ltr", width: 191.433141, height: 12.546, transform: array[6], fontname: "g_d0_f2" }
is possible somehow more information g_d0_f2.
notice pdf.js gettextcontent not , not suppose match glyphs in pdfs. pdf32000 specification has 2 different algorithms text display , extraction. if can lookup font data in page.commonobjs, might not helpful extracted text content display due glyphs encoding mismatch.
the page's gettextcontent doing text extraction , getoperatorlist gets (glyph) display operators. see how src/display/svg.js renderer displays glyphs.
Comments
Post a Comment