Home How do I encode text to utf-16be "correctly"?

Questions

How do I encode text to utf-16be "correctly"?

June 24, 2022

I am trying to reproduce the (ABC) example from this site:
https://opensource.adobe.com/dc-acrobat-sdk-docs/acrobatsdk/html2015/index.html#t=Acro12_MasterBook%2Fpdfmark_Basic%2FBookmarks_OUT.htm

For example, the Unicode string for (ABC) is <FEFF004100420043>.

But when I try to reproduce just the ABC, I get:

"ABC".encode(encoding="utf-16be")
Out[29]: b'\x00A\x00B\x00C'

I think I am misunderstanding a larger concept, but I am unsure what to look for.

I need to produce the exact same string, so for the minimal example above I would need: 004100420043. The question therefore is: How do I get from one representation to the other?

Given the already existing answer by gog:
How do I get from b'\xFE\xFF\x00\x41\x00\x42\x00\x43' to FEFF004100420043

>Solution :

Look like they want BOM as well, so

import codecs
result = codecs.BOM_UTF16_BE + "ABC".encode(encoding="utf-16be")

which would be

b'\xfe\xff\x00A\x00B\x00C'

which is the same as

b'\xFE\xFF\x00\x41\x00\x42\x00\x43'

To convert that to the hex format, use

result.hex()

optionally followed by .upper()

encoding

byMR

Published June 24, 2022

Add a comment

How to pass multiple values in react through radio input

byMR

June 24, 2022

Questions

mysql select statement IN

byMR

June 24, 2022

Questions

Await result in js file with sqlite3

byMR

June 24, 2022

Questions

Can't use LAPACK in makefile

byMR

June 24, 2022

Questions

Multimodular Android App: Unable to access Strings from another module

byMR

June 24, 2022

Questions

R: How to remove duplicated entry across columns within each row

byMR

June 24, 2022

How do I encode text to utf-16be "correctly"?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to pass multiple values in react through radio input

mysql select statement IN

Await result in js file with sqlite3

Can't use LAPACK in makefile

Multimodular Android App: Unable to access Strings from another module

R: How to remove duplicated entry across columns within each row

Keep Up to Date with the Most Important News

How do I encode text to utf-16be "correctly"?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to pass multiple values in react through radio input

mysql select statement IN

Await result in js file with sqlite3

Can't use LAPACK in makefile

Multimodular Android App: Unable to access Strings from another module

R: How to remove duplicated entry across columns within each row

Discover more from Dev solutions