[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] unicode with strcat



You need to get the UTF-8 encoding onto your command line (for condor_hold -reason) or into your configuration file (for WANT_HOLD_REASON). That will depend on your terminal and text editor.

 - Jaime

On Oct 12, 2023, at 9:30 AM, Rita <rmorgan466@xxxxxxxxx> wrote:

for my hold reason, I would like to have stop sign emoji -- U+1F6D1.  How can I do that?

On Thu, Oct 12, 2023 at 10:22âAM Jaime Frey via HTCondor-users <htcondor-users@xxxxxxxxxxx> wrote:
> On Oct 11, 2023, at 4:06 PM, Rita <rmorgan466@xxxxxxxxx> wrote:
>
> I would like to put UTF-8 encoding in my strcat command for better messages. Is that possible with HTcondor?


Iâm unsure if youâre talking about the strcat command-line tool or the strcat() ClassAd function, so Iâll address both.

The jobâs stdout and data files can contain any data format you want.

The ClassAd language naively tolerates UTF-8 encoding in string values. It has no understanding of unicode code points or multi-byte characters. It will preserve the values, but treat each byte as an independent character. Thus, the strcat() and strcmp() functions will do byte-by-byte copying and comparison of string values with UTF-8 encoding. But the substr() function will count bytes and happily bisect a multi-byte character.

 - Jaime