[xep-support] Re: Encoding spaces at line ends and between words

From: Armin Günther <guenther_at_ADDRESS_REMOVED>
Date: Tue May 10 2016 - 02:06:20 PDT

Hi Fritz,

and thank you for your tests, very much appreciated! I was not sure if
we have a problem with our PDFs or if the problem is with the PDF
viewer. My conclusion so far is, that it is a problem of the viewer.
When I open the PDF with the build in Firefox PDF viewer and copy it to
notepad++ I get

====
Gender, in contrast, is a
social,
not a biological characteristic.
Gender
consists of what-
ever behaviors and attitudes a group considers proper for its males and
females. Conse-
quently, gender varies from one society to another. Whereas
sex
refers to male or female,
gender
refers to masculinity or femininity. In short, you inherit your sex, but
you learn
your gender as you are socialized into the behaviors and attitudes your
culture asserts are
appropriate for your sex.
====

So obviously when the lines are joined without adding a space at end of
line words get concatenated. And that's the problem we observe with PDF
annotation tools like hypothes.is

Thanks again and best wishes

Armin

PS. Sorry for posting my question twice - I thought that my first mail
was not delivered due to a not registered sender address.

Am 09.05.2016 um 12:50 schrieb Kirch Fritz:
> Hi,
>
> I opened the PDF-file with
> a) SumatraPDFViewer
> b) PDF-XChange-Viewer
> c) Acrobat Reader
> copied the text in into clipboard and pasted the text into a new notepad++ file
>
>
> Here are the results:
>
> a) SumatraPDFViewer:
>
> Gender, in contrast, is a social, not a biological characteristic. Gender consists of what-
> ever behaviors and attitudes a group considers proper for its males and females. Conse-
> quently, gender varies from one society to another. Whereas sex refers to male or female,
> gender refers to masculinity or femininity. In short, you inherit your sex, but you learn
> your gender as you are socialized into the behaviors and attitudes your culture asserts are
> appropriate for your sex.
>
> *************************************
>
> b) PDF-XChange-Viewer:
>
> Gender, in contrast, is a social, not a biological characteristic. Gender consists of what-
> ever behaviors and attitudes a group considers proper for its males and females. Conse-
> quently, gender varies from one society to another. Whereas sex refers to male or female,
> gender refers to masculinity or femininity. In short, you inherit your sex, but you learn
> your gender as you are socialized into the behaviors and attitudes your culture asserts are
> appropriate for your sex.
>
> **************************************
>
> c) Acrobat Reader:
>
> Gender, in contrast, is a social, not a biological characteristic. Gender consists of whatever
> behaviors and attitudes a group considers proper for its males and females. Consequently,
> gender varies from one society to another. Whereas sex refers to male or female,
> gender refers to masculinity or femininity. In short, you inherit your sex, but you learn
> your gender as you are socialized into the behaviors and attitudes your culture asserts are
> appropriate for your sex.
> ********************************************
> ********************************************
>
> In all cases I cannot see any problems relating to spaces.
>
>
> Kind regards
>
> Fritz Kirch
> Nürnberg
>
> -----Ursprüngliche Nachricht-----
> Von: Xep-support [mailto:xep-support-bounces@renderx.com] Im Auftrag von Armin Günther
> Gesendet: Montag, 2. Mai 2016 14:15
> An: 'RenderX Community Support List' <xep-support@renderx.com>
> Betreff: [xep-support] Re: Encoding spaces at line ends and between words
>
> As an addition to my previous post: Perhaps this mostly is not a XEP/PDF problem but a problem of the respective PDF-viewer?
>
> - Armin
>
>
>> Hi all,
>>
>> Is there a way to control how spaces at line ends and between words
>> with different styles are encoded into PDF? There seems to be no space
>> encoded at line ends and between words with different styles (e.g.
>> bold/italics/normal). When we use PDF annotation tools on our PDFs
>> generated by XEP (or simply want to copy text from PDFs generated by
>> XEP) words/lines are concatenated as in the following example taken
>> from a RenderX demo document
>> (http://www.renderx.com/files/demos/examples/CH11.pdf):
>>
>> Gender, in contrast, is a *social,not* a biological characteristic.
>> *Genderconsists* of *what-ever* behaviors and attitudes a group
>> considers proper for its males and females. *Conse-quently*, gender
>> varies from one society to another. Whereas *sexrefers* to male or
>> *female,genderrefers* to masculinity or femininity. In short, you
>> inherit your sex, but you *learnyour* gender as you are socialized
>> into the behaviors and attitudes your culture asserts *areappropriate*
>> for your sex.
>> *Text copied from http://www.renderx.com/files/demos/examples/CH11.pdf
>> with *missing spaces*
>>
>> Is there a way to have real spaces encoded and not just positioning of
>> text in PDFs here? The result should look like this text:
>>
>> Gender, in contrast, is a *social, not* a biological characteristic.
>> *Gender consists* of *what- ever* behaviors and attitudes a group
>> considers proper for its males and females. *Conse- quently*, gender
>> varies from one society to another. Whereas *sex refers* to male or
>> *female, gender refers* to masculinity or femininity. In short, you
>> inherit your sex, but you *learn your* gender as you are socialized
>> into the behaviors and attitudes your culture asserts *are
>> appropriate* for your sex.
>>
>> Thanks!
>> Armin
>>
>>
>
> _______________________________________________
> (*) To unsubscribe, please visit http://lists.renderx.com/mailman/options/xep-support
> (*) By using the Service, you expressly agree to these Terms of Service http://w ww.renderx.com/terms-of-service.html
>
> **********************************************************************
> Diese E-Mail wurde auf Viren ueberprueft.
> mailsweeper@it2media.de
> **********************************************************************
>
> _______________________________________________
> (*) To unsubscribe, please visit http://lists.renderx.com/mailman/options/xep-support
> (*) By using the Service, you expressly agree to these Terms of Service http://w
> ww.renderx.com/terms-of-service.html

-- 
ZPID - Leibniz-Zentrum für Psychologische Information und Dokumentation
ZPID - Leibniz-Institute for Psychology Information
Prof. Dr. Armin Günther
R & D
guenther@zpid.de
http://www.zpid.de/
Fon: +49(0)651-201-2055
Fax: +49(0)651-201-2604
_______________________________________________
(*) To unsubscribe, please visit http://lists.renderx.com/mailman/options/xep-support
(*) By using the Service, you expressly agree to these Terms of Service http://w
ww.renderx.com/terms-of-service.html
Received on Tue May 10 02:04:31 2016

This archive was generated by hypermail 2.1.8 : Tue May 10 2016 - 02:04:36 PDT