Skip to main content
-1 votes
0 answers
76 views

What is the character code of the debugger on Visual Studio?

What is the character code of those characters appear when you place mouse-pointer on a variable? I'm trying to read in a UTF-8 written file. (Written in Japanese.)I've read in a Shift-JIS file and ...
taka11110010's user avatar
0 votes
0 answers
42 views

Japanese Character set via ODBC to Access and Excel

I've hit an issue with UTF-8 characters and having had a trawl around here and the web I can find similar issues but everything I have tried doesn't work. For the record I'm self-taught so feel free ...
N G's user avatar
  • 21
1 vote
2 answers
56 views

Angular unit tests failed because of special character (UTF8 issue ?)

I'm using angular 19 since couple of months and i test my app with karma in a bash temrinal. I don't know why and when but since some days, my unit tests failed on special characters not recognized by ...
matt2mi's user avatar
  • 164
0 votes
1 answer
55 views

Translate UTF-8 punctuation with normal ascii punctuation marks

I'm trying to cleanup a raw data that has embedded \r\n or \n in csv lines.Line terminator is \r\n. trying to translate utf-8 punctuation marks to normal ascii punctuation marks. cleaning up any ...
ramesh.metta's user avatar
0 votes
1 answer
89 views

PyPy3 on Windows 11 doesn't display non-ASCII characters correctly

PyPy3 doesn't display non-ASCII UNICODE characters correctly. A simple example, the following: b'\xce\x9e\xce\xad\xce\xbd\xce\xb7 \xce\x93\xce\xae\xce\xb9\xce\xbd\xce\xbf\xcf\x82'.decode('utf8') ...
Ξένη Γήινος's user avatar
1 vote
1 answer
83 views

Why does Encoding.UTF8.GetMaxByteCount(1) return 6?

The TLDR here is simple: What's a sequence of chars that would make either UTF8's Encoding or Encoder return 6 (or even 5) bytes for a single char, as GetMaxByteCount implies it might? The non-TLDR: ...
David Wohlferd's user avatar
0 votes
3 answers
299 views

std::basic_string<unsigned char> is undefined in Xcode 16.3

I just upgraded from Xcode 16.2 to Xcode 16.3, compiling with C++11. I have been using this definition for years: typedef std::basic_string<unsigned char> ustring; But now, apparently, this ...
David M. Cotter's user avatar
4 votes
1 answer
71 views

How can I quickly split an utf8 string into chars in OCaml?

I'm handling a string with special characters and I want to split it into Unicode characters, but apparently OCaml strings are encoded in utf-8 while OCaml chars are 1-byte values, and it appears that ...
user8203231's user avatar
1 vote
1 answer
69 views

How to correctly read and convert an ANSI encoded file to String in Dart? [closed]

I'm trying to read a text file saved with ANSI encoding on my system (Windows) using Dart. The content appears with strange characters (accents, cedillas, etc.). How can I read this file correctly and ...
Vinícius Bruno's user avatar
0 votes
1 answer
214 views

How to efficiently calculate the fraction (valid UTF8 byte sequence of length N)/(total N-byte sequences)? [closed]

This will be a long post. And it absolutely has nothing to do with homework, I am just curious, and this won't have immediate practical benefits, but that is like pursuing pure science, you never know ...
Ξένη Γήινος's user avatar
0 votes
1 answer
79 views

Java throws illegal character: '\ufeff' when connecting to Microsoft SQL Server using JDBC [duplicate]

I'm trying to create a simple Java application to connect to a Microsoft SQL Server database for a login interface. I’ve installed the following: JDK 24 MySQL Connector/J 9.2.0 Here's the code I'm ...
Mohamed ghazy's user avatar
-2 votes
2 answers
62 views

Does checking a prefix string by comparing to a byte slice fail?

I'm learning the book "Go Programing Language", when it introduce string, it says Go use utf-8 encoding system, so it's easy to check whether a string is a prefix/suffix of another base ...
hankeyyh's user avatar
  • 139
-3 votes
1 answer
81 views

What characters or encoding is replacing spaces in datetime format?

I am trying different cultures in spanish language and testing some latin and south american cultures. When using "es-MX" culture the console shows me correctly the date, but when using &...
iEligio's user avatar
0 votes
1 answer
55 views

Azure functions decoding utf8 strings in Python

I'm running this code on my Python code with version 3.10.6: def downloadRevistaXML(): url= f"https://revistas.inpi.gov.br/txt/RM2823.zip" try: response = requests.get(url) ...
SFG's user avatar
  • 1
0 votes
1 answer
66 views

Can I specify the replacement character in str([value], encoding='utf-8', errors='replace')

Is it possible to specify the replacement character used by str(xxx,encoding='utf-8', errors='replace') to be something other than the diamond-question-mark character (�)? I am attempting to fix up a ...
Jim JR Harris's user avatar

15 30 50 per page
1
2 3 4 5
1495