22,414 questions
-1
votes
0
answers
76
views
What is the character code of the debugger on Visual Studio?
What is the character code of those characters appear when you place mouse-pointer on a variable?
I'm trying to read in a UTF-8 written file. (Written in Japanese.)I've read in a Shift-JIS file and ...
0
votes
0
answers
42
views
Japanese Character set via ODBC to Access and Excel
I've hit an issue with UTF-8 characters and having had a trawl around here and the web I can find similar issues but everything I have tried doesn't work. For the record I'm self-taught so feel free ...
1
vote
2
answers
56
views
Angular unit tests failed because of special character (UTF8 issue ?)
I'm using angular 19 since couple of months and i test my app with karma in a bash temrinal.
I don't know why and when but since some days, my unit tests failed on special characters not recognized by ...
0
votes
1
answer
55
views
Translate UTF-8 punctuation with normal ascii punctuation marks
I'm trying to cleanup a raw data that has embedded \r\n or \n in csv lines.Line terminator is \r\n.
trying to translate utf-8 punctuation marks to normal ascii punctuation marks.
cleaning up any ...
0
votes
1
answer
89
views
PyPy3 on Windows 11 doesn't display non-ASCII characters correctly
PyPy3 doesn't display non-ASCII UNICODE characters correctly.
A simple example, the following:
b'\xce\x9e\xce\xad\xce\xbd\xce\xb7 \xce\x93\xce\xae\xce\xb9\xce\xbd\xce\xbf\xcf\x82'.decode('utf8')
...
1
vote
1
answer
83
views
Why does Encoding.UTF8.GetMaxByteCount(1) return 6?
The TLDR here is simple: What's a sequence of chars that would make either UTF8's Encoding or Encoder return 6 (or even 5) bytes for a single char, as GetMaxByteCount implies it might?
The non-TLDR:
...
0
votes
3
answers
299
views
std::basic_string<unsigned char> is undefined in Xcode 16.3
I just upgraded from Xcode 16.2 to Xcode 16.3, compiling with C++11.
I have been using this definition for years:
typedef std::basic_string<unsigned char> ustring;
But now, apparently, this ...
4
votes
1
answer
71
views
How can I quickly split an utf8 string into chars in OCaml?
I'm handling a string with special characters and I want to split it into Unicode characters, but apparently OCaml strings are encoded in utf-8 while OCaml chars are 1-byte values, and it appears that ...
1
vote
1
answer
69
views
How to correctly read and convert an ANSI encoded file to String in Dart? [closed]
I'm trying to read a text file saved with ANSI encoding on my system (Windows) using Dart. The content appears with strange characters (accents, cedillas, etc.). How can I read this file correctly and ...
0
votes
1
answer
214
views
How to efficiently calculate the fraction (valid UTF8 byte sequence of length N)/(total N-byte sequences)? [closed]
This will be a long post. And it absolutely has nothing to do with homework, I am just curious, and this won't have immediate practical benefits, but that is like pursuing pure science, you never know ...
0
votes
1
answer
79
views
Java throws illegal character: '\ufeff' when connecting to Microsoft SQL Server using JDBC [duplicate]
I'm trying to create a simple Java application to connect to a Microsoft SQL Server database for a login interface.
I’ve installed the following:
JDK 24
MySQL Connector/J 9.2.0
Here's the code I'm ...
-2
votes
2
answers
62
views
Does checking a prefix string by comparing to a byte slice fail?
I'm learning the book "Go Programing Language", when it introduce string, it says Go use utf-8 encoding system, so it's easy to check whether a string is a prefix/suffix of another base ...
-3
votes
1
answer
81
views
What characters or encoding is replacing spaces in datetime format?
I am trying different cultures in spanish language and testing some latin and south american cultures.
When using "es-MX" culture the console shows me correctly the date, but when using &...
0
votes
1
answer
55
views
Azure functions decoding utf8 strings in Python
I'm running this code on my Python code with version 3.10.6:
def downloadRevistaXML():
url= f"https://revistas.inpi.gov.br/txt/RM2823.zip"
try:
response = requests.get(url)
...
0
votes
1
answer
66
views
Can I specify the replacement character in str([value], encoding='utf-8', errors='replace')
Is it possible to specify the replacement character used by str(xxx,encoding='utf-8', errors='replace') to be something other than the diamond-question-mark character (�)?
I am attempting to fix up a ...