<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40"><head><META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=us-ascii"><meta name=Generator content="Microsoft Word 14 (filtered medium)"><!--[if !mso]><style>v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]--><style><!--
/* Font Definitions */
@font-face
{font-family:SimSun;
panose-1:2 1 6 0 3 1 1 1 1 1;}
@font-face
{font-family:SimSun;
panose-1:2 1 6 0 3 1 1 1 1 1;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Tahoma;
panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
{font-family:Consolas;
panose-1:2 11 6 9 2 2 4 3 2 4;}
@font-face
{font-family:"\@SimSun";
panose-1:2 1 6 0 3 1 1 1 1 1;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
p
{mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
code
{mso-style-priority:99;
font-family:"Courier New";}
p.MsoAcetate, li.MsoAcetate, div.MsoAcetate
{mso-style-priority:99;
mso-style-link:"Balloon Text Char";
margin:0cm;
margin-bottom:.0001pt;
font-size:8.0pt;
font-family:"Tahoma","sans-serif";}
p.ecxmsonormal, li.ecxmsonormal, div.ecxmsonormal
{mso-style-name:ecxmsonormal;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
p.ecxmsoplaintext, li.ecxmsoplaintext, div.ecxmsoplaintext
{mso-style-name:ecxmsoplaintext;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
p.ecxmsoacetate, li.ecxmsoacetate, div.ecxmsoacetate
{mso-style-name:ecxmsoacetate;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
p.ecxmsochpdefault, li.ecxmsochpdefault, div.ecxmsochpdefault
{mso-style-name:ecxmsochpdefault;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
span.ecxmsohyperlink
{mso-style-name:ecxmsohyperlink;}
span.ecxmsohyperlinkfollowed
{mso-style-name:ecxmsohyperlinkfollowed;}
span.ecxplaintextchar
{mso-style-name:ecxplaintextchar;}
span.ecxballoontextchar
{mso-style-name:ecxballoontextchar;}
p.ecxmsonormal1, li.ecxmsonormal1, div.ecxmsonormal1
{mso-style-name:ecxmsonormal1;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:11.0pt;
font-family:"Calibri","sans-serif";}
span.ecxmsohyperlink1
{mso-style-name:ecxmsohyperlink1;
color:blue;
text-decoration:underline;}
span.ecxmsohyperlinkfollowed1
{mso-style-name:ecxmsohyperlinkfollowed1;
color:purple;
text-decoration:underline;}
p.ecxmsoplaintext1, li.ecxmsoplaintext1, div.ecxmsoplaintext1
{mso-style-name:ecxmsoplaintext1;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:11.0pt;
font-family:"Calibri","sans-serif";}
p.ecxmsoacetate1, li.ecxmsoacetate1, div.ecxmsoacetate1
{mso-style-name:ecxmsoacetate1;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:8.0pt;
font-family:"Tahoma","sans-serif";}
span.ecxplaintextchar1
{mso-style-name:ecxplaintextchar1;
font-family:"Calibri","sans-serif";}
span.ecxballoontextchar1
{mso-style-name:ecxballoontextchar1;
font-family:"Tahoma","sans-serif";}
p.ecxmsochpdefault1, li.ecxmsochpdefault1, div.ecxmsochpdefault1
{mso-style-name:ecxmsochpdefault1;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Calibri","sans-serif";}
span.BalloonTextChar
{mso-style-name:"Balloon Text Char";
mso-style-priority:99;
mso-style-link:"Balloon Text";
font-family:"Tahoma","sans-serif";}
span.EmailStyle37
{mso-style-type:personal-reply;
font-family:"Calibri","sans-serif";
color:#1F497D;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]--></head><body lang=EN-GB link=blue vlink=purple><div class=WordSection1><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'>I’m pretty certain the problem is down to pasting in non UTF code.<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'>So how best to make sure it doesn’t get through to the PHP? Old Flash isn’t so good at this sort of thing. I already have event handlers looking of r a large change in size of the text, to detect pastes. I can do something there in the editor to strip out non-UTF8 I think?<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'>J<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D'><o:p> </o:p></span></p><div><div style='border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0cm 0cm 0cm'><p class=MsoNormal><b><span lang=EN-US style='font-size:10.0pt;font-family:"Tahoma","sans-serif"'>From:</span></b><span lang=EN-US style='font-size:10.0pt;font-family:"Tahoma","sans-serif"'> xerte-dev-bounces@lists.nottingham.ac.uk [mailto:xerte-dev-bounces@lists.nottingham.ac.uk] <b>On Behalf Of </b>Dave Burnett<br><b>Sent:</b> 12 May 2014 14:15<br><b>To:</b> For Xerte technical developers<br><b>Subject:</b> [Xerte-dev] Re: FW: Further Xerte Data Loss<o:p></o:p></span></p></div></div><p class=MsoNormal><o:p> </o:p></p><div><div><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'><o:p> </o:p></span></p></div><div><p class=MsoNormal><span style='font-size:10.5pt;font-family:"Arial","sans-serif";background:white'>It's a </span><code><span style='font-size:10.5pt;font-family:Consolas;border:none windowtext 1.0pt;padding:0cm;background:#EEEEEE'>’</span></code><span style='font-size:10.5pt;font-family:"Arial","sans-serif";background:white'> (</span><span style='font-family:"Calibri","sans-serif"'><a href="http://www.fileformat.info/info/unicode/char/2019/index.htm"><code><span style='font-size:10.5pt;font-family:Consolas;color:#4A6B82;border:none windowtext 1.0pt;padding:0cm;background:#EEEEEE'>RIGHT SINGLE QUOTATION MARK</span></code></a></span><span style='font-size:10.5pt;font-family:"Arial","sans-serif";background:white'> - U+2019) character which has been encoded as </span><span style='font-family:"Calibri","sans-serif"'><a href="http://en.wikipedia.org/wiki/Windows-1252"><code><span style='font-size:10.5pt;font-family:Consolas;color:#4A6B82;border:none windowtext 1.0pt;padding:0cm;background:#EEEEEE;text-decoration:none'>CP-1252</span></code></a></span><span style='font-size:10.5pt;font-family:"Arial","sans-serif";background:white'> instead of </span><span style='font-family:"Calibri","sans-serif"'><a href="http://en.wikipedia.org/wiki/UTF-8"><code><span style='font-size:10.5pt;font-family:Consolas;color:#4A6B82;border:none windowtext 1.0pt;padding:0cm;background:#EEEEEE;text-decoration:none'>UTF-8</span></code></a></span><span style='font-size:10.5pt;font-family:"Arial","sans-serif";background:white'>.</span><span style='font-family:"Calibri","sans-serif"'><o:p></o:p></span></p></div><div><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'><o:p> </o:p></span></p></div><div><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'><a href="http://stackoverflow.com/questions/2477452/a-showing-on-page-instead-of" target="_blank">http://stackoverflow.com/questions/2477452/a-showing-on-page-instead-of</a><o:p></o:p></span></p></div><div><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'><o:p> </o:p></span></p></div><div><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>(and I assume the tilded version is the left single quote).<o:p></o:p></span></p></div><div><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'><o:p> </o:p></span></p></div><div><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p></div><p class=MsoNormal style='margin-bottom:12.0pt'><span style='font-family:"Calibri","sans-serif"'><o:p> </o:p></span></p><div><div class=MsoNormal align=center style='text-align:center'><span style='font-family:"Calibri","sans-serif"'><hr size=2 width="100%" align=center id=stopSpelling></span></div><p class=MsoNormal style='margin-bottom:12.0pt'><span style='font-family:"Calibri","sans-serif"'>From: <a href="mailto:Julian.Tenney@nottingham.ac.uk">Julian.Tenney@nottingham.ac.uk</a><br>To: <a href="mailto:xerte-dev@lists.nottingham.ac.uk">xerte-dev@lists.nottingham.ac.uk</a><br>Date: Mon, 12 May 2014 13:23:40 +0100<br>Subject: [Xerte-dev] Re: FW: Further Xerte Data Loss<o:p></o:p></span></p><div><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>This looks suspicious: that’s not escaping, right? That looks to me like errant characters creeping in from somewhere? Paste from word, etc?<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'><img border=0 width=611 height=432 id="ecxPicture_x0020_1" src="cid:image001.png@01CF6DED.076A2FB0"><o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span lang=EN-US style='font-family:"Calibri","sans-serif"'>-----Original Message-----<br>From: <a href="mailto:xerte-dev-bounces@lists.nottingham.ac.uk">xerte-dev-bounces@lists.nottingham.ac.uk</a> [<a href="mailto:xerte-dev-bounces@lists.nottingham.ac.uk">mailto:xerte-dev-bounces@lists.nottingham.ac.uk</a>] On Behalf Of David Goodwin<br>Sent: 12 May 2014 12:53<br>To: For Xerte technical developers<br>Subject: [Xerte-dev] Re: FW: Further Xerte Data Loss</span><span style='font-family:"Calibri","sans-serif"'><o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>On 12/05/14 12:41, Julian Tenney wrote:<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> What beats me is why the file should break halfway through a tag name.<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>What I've seen is that the PHP (or whatever) can't read the XML file as it has the wrong encoding specified, or it just somehow contains characters that are invalid.<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>see e.g.<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'><a href="http://stackoverflow.com/questions/14463573/php-simplexml-load-file-invalid-character-error" target="_blank"><span style='color:windowtext;text-decoration:none'>http://stackoverflow.com/questions/14463573/php-simplexml-load-file-invalid-character-error</span></a><o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'><a href="https://chrismckee.co.uk/saving-user-content-to-xml-error-contains-none-utf-8-content-aka-how-to-remove-invalid-characters-in-utf-8/" target="_blank"><span style='color:windowtext;text-decoration:none'>https://chrismckee.co.uk/saving-user-content-to-xml-error-contains-none-utf-8-content-aka-how-to-remove-invalid-characters-in-utf-8/</span></a><o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>David.<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> -----Original Message-----<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> From: <a href="mailto:xerte-dev-bounces@lists.nottingham.ac.uk"><span style='color:windowtext;text-decoration:none'>xerte-dev-bounces@lists.nottingham.ac.uk</span></a> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> [<a href="mailto:xerte-dev-bounces@lists.nottingham.ac.uk"><span style='color:windowtext;text-decoration:none'>mailto:xerte-dev-bounces@lists.nottingham.ac.uk</span></a>] On Behalf Of David <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> Goodwin<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> Sent: 12 May 2014 12:37<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> To: For Xerte technical developers<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> Subject: [Xerte-dev] Re: FW: Further Xerte Data Loss<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> Is it possible that it's a unicode ` character?<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> I've seen a number of UTF-8 related issues with the data.xml etc files.<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> David.<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>--<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>Pale Purple Ltd<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>PHP Web application development and support<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'><a href="http://www.palepurple.co.uk" target="_blank"><span style='color:windowtext;text-decoration:none'>http://www.palepurple.co.uk</span></a><o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>@PalePurpleLtd<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>07792 380669 / 0845 0046746<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'> <o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>_______________________________________________<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'>Xerte-dev mailing list<o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'><a href="mailto:Xerte-dev@lists.nottingham.ac.uk"><span style='color:windowtext;text-decoration:none'>Xerte-dev@lists.nottingham.ac.uk</span></a><o:p></o:p></span></p><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'><a href="http://lists.nottingham.ac.uk/mailman/listinfo/xerte-dev" target="_blank"><span style='color:windowtext;text-decoration:none'>http://lists.nottingham.ac.uk/mailman/listinfo/xerte-dev</span></a><o:p></o:p></span></p></div><p class=MsoNormal><span style='font-family:"Calibri","sans-serif"'><br>_______________________________________________ Xerte-dev mailing list <a href="mailto:Xerte-dev@lists.nottingham.ac.uk">Xerte-dev@lists.nottingham.ac.uk</a> <a href="http://lists.nottingham.ac.uk/mailman/listinfo/xerte-dev">http://lists.nottingham.ac.uk/mailman/listinfo/xerte-dev</a><o:p></o:p></span></p></div></div></div></body></html>