Something is wrong with forum text encoding

Conversation au sujet de Encore en français

Moderators: Hotch, Denkster, John Miller

Something is wrong with forum text encoding

Postby polarbreeze » Sat May 05, 2012 11:09 am

I have tried all available text encoding options on the Mac in Safari, Firefox and Internet Explorer (via Parallels) - none of them render the forum text properly, for example the C-cedille in the head of this topic. What I see ends up being something like this:
Screen Shot 2012-05-05 at 7.06.16 AM.png
Screen Shot 2012-05-05 at 7.06.16 AM.png (10.8 KiB) Viewed 2269 times
polarbreeze
 
Posts: 3494
Joined: Fri Sep 14, 2007 6:56 pm

Re: Something is wrong with forum text encoding

Postby Denkster » Sat May 05, 2012 2:26 pm

Hi Polarbreeze,

Probably, an amateur moved the forum to the new platform.
While setting up the database and exporting-importing the database content, neglected to preserve character encoding of the database management system and data in the migration process.
Probably the DBMS renders UTF-8 coding as ISO-8xxx .

Sigh..

Eveline
Denkster
 
Posts: 3357
Joined: Fri Sep 14, 2007 1:28 pm
Location: The Netherlands, 's-Gravenhage

Re: Something is wrong with forum text encoding

Postby polarbreeze » Sat May 05, 2012 3:29 pm

Denkster wrote:... neglected to preserve character encoding of the database management system and data in the migration process...
Oh, that will be hard to fix globally won't it. But at least they should correct the more visible errors, like the forum headings for example...
polarbreeze
 
Posts: 3494
Joined: Fri Sep 14, 2007 6:56 pm

Re: Something is wrong with forum text encoding

Postby q » Sat May 05, 2012 4:53 pm

Looks like this has strewn ugly stuff throughout the forum. Here's but one example, where I think some curly quotes and apostrophes were pasted in.

http://www.gvox.com/forum/viewtopic.php?f=18&t=3040#p17720

q


Denkster wrote:Hi Polarbreeze,

Probably, an amateur moved the forum to the new platform.
While setting up the database and exporting-importing the database content, neglected to preserve character encoding of the database management system and data in the migration process.
Probably the DBMS renders UTF-8 coding as ISO-8xxx .

Sigh..

Eveline
q
 
Posts: 3525
Joined: Thu Sep 13, 2007 7:24 pm
Location: San Francisco, East Bay Area

Re: Something is wrong with forum text encoding

Postby Andre_B » Mon May 07, 2012 11:43 am

I thought first something had to be fixed "globally" so that the forum would accept the non-US characters, but - having done an append in French - it looks like the site or the forum perfectly accepts these, but that the data were once corrupted.
Every subscriber could edit and correct his own appends - though some of mine were done with a userid which I lost.
Andre_B
 
Posts: 92
Joined: Mon Oct 03, 2011 5:27 pm

Re: Something is wrong with forum text encoding

Postby Hotch » Mon May 07, 2012 2:02 pm

Hi,

We migrated to a new server and one of the problems seems to be the importing of the forum and text coding. I have our web developer and our hosting company looking into to a solution, but it may not be fixable without reverting back to the date we made the change. If that is the situation, we may "live with it".
Best regards,

Richard Hotchkiss
CEO, Gvox
103 Carnegie Center
Ste 300
Princeton NJ 08540

PO Box 7401
Princeton, NJ 08543
(732) 565-3842
(732) 284-3748 Fax

http://www.gvox.com

bring music to life
Hotch
 
Posts: 600
Joined: Sun Jul 29, 2007 4:56 pm
Location: Princeton, NJ

Re: Something is wrong with forum text encoding

Postby q » Sat May 26, 2012 4:58 pm

Denkster wrote:... neglected to preserve character encoding of the database management system and data in the migration process...

polarbreeze wrote:... at least they should correct the more visible errors, like the forum headings for example...

Seems like a simple text search and replace throughout the php database would handle much of this blemish and annoyance.

Here's what I've detected so far:

    ================================
    Â = "hard" non breaking space
     <--- exists here invisibly
    ASCII 202
    option space on Mac

    ================================
    — = em dash

    ASCII 209
    option-shift-dash on Mac

    ================================
    “ = smart opening double quote

    ASCII 210
    opt [ on mac

    ================================
    †= smart closing double quote

    ASCII 211
    opt shift [ on mac

    ================================
    For the record here's a smart opening single quote:

    ASCII 212
    option-] on Mac

    ================================
    ’ = smart closing single quote / smart apostrophe??

    ASCII 213
    option-shift-] on Mac

    ================================
    • = bullet

    ASCII 165
    option 8 on Mac

Doug, you may want to weigh in on the accuracy of the ASCII numbers and character names.

Of course this doesn't account for the diacritical marks and foreign language symbols that first brought this issue to light.

q
Last edited by q on Sat May 26, 2012 8:23 pm, edited 1 time in total.
q
 
Posts: 3525
Joined: Thu Sep 13, 2007 7:24 pm
Location: San Francisco, East Bay Area

Re: Something is wrong with forum text encoding

Postby Denkster » Sat May 26, 2012 7:58 pm

The fix for the correct rendering of the old content might be: the correct setting of one parameter..

That might however corrupt the new content.
The longer the forum administrator waits, the bigger the problem.

Search and replace?
The phpBB probably is based on MySQL, a really good open source DBMS!
One single query should be enough.

Of course the "somebody's little nephew" in charge of the forum lacks the required competence.

Eveline
Denkster
 
Posts: 3357
Joined: Fri Sep 14, 2007 1:28 pm
Location: The Netherlands, 's-Gravenhage

Re: Something is wrong with forum text encoding

Postby q » Sat May 26, 2012 8:25 pm

Denkster wrote:The fix for the correct rendering of the old content might be: the correct setting of one parameter...

That might however corrupt the new content.
The longer the forum administrator waits, the bigger the problem.

Of course the "somebody's little nephew" in charge of the forum lacks the required competence.

Eveline

Eveline,

These very thoughts had crossed my mind. Thanks for voicing them.

q
q
 
Posts: 3525
Joined: Thu Sep 13, 2007 7:24 pm
Location: San Francisco, East Bay Area

Re: Something is wrong with forum text encoding

Postby Hotch » Wed May 30, 2012 3:36 pm

The text encoding issue has been solved to the point where any new posts will be encoded correctly.
Best regards,

Richard Hotchkiss
CEO, Gvox
103 Carnegie Center
Ste 300
Princeton NJ 08540

PO Box 7401
Princeton, NJ 08543
(732) 565-3842
(732) 284-3748 Fax

http://www.gvox.com

bring music to life
Hotch
 
Posts: 600
Joined: Sun Jul 29, 2007 4:56 pm
Location: Princeton, NJ

Re: Something is wrong with forum text encoding

Postby Andre_B » Thu May 31, 2012 7:12 am

Hotch wrote:The text encoding issue has been solved to the point where any new posts will be encoded correctly.



Yes, that's what I understood from the test I talked of on May 7th (here above).

In addition, I noticed that the category in the Board Index "Encore en Français" - which was presented in the first append of this thread - was corrected. I think it was the only "wrong phrase" in a header.

Other occurences could be retyped by their initial writers, it they want to.
Andre_B
 
Posts: 92
Joined: Mon Oct 03, 2011 5:27 pm


Return to Encore en français

Who is online

Users browsing this forum: No registered users and 0 guests