Modify

Opened 8 years ago

Closed 8 years ago

Last modified 7 years ago

#841 closed defect (invalid)

Crash when trying to render certain characters

Reported by: anonymous Owned by: athomas
Priority: normal Component: PageToPdfPlugin
Severity: normal Keywords:
Cc: mankoff+pdfplugin@… Trac Release: 0.10

Description

It seems that '´', '`', and the microsoft smart quotes, crash my instance of the plugin.

I have the following defined in trac.ini:

[pagetopdf]
size = A4
charset = iso-8859-15

The wiki has following text:

´

Python Traceback

Traceback (most recent call last):
  File "/usr/lib64/python2.4/site-packages/trac/web/main.py", line 313, in dispatch_request
    dispatcher.dispatch(req)
  File "/usr/lib64/python2.4/site-packages/trac/web/main.py", line 198, in dispatch
    resp = chosen_handler.process_request(req)
  File "/usr/lib64/python2.4/site-packages/trac/wiki/web_ui.py", line 126, in process_request
    page.text, format, page.name)
  File "/usr/lib64/python2.4/site-packages/trac/mimeview/api.py", line 550, in send_converted
    content, selector)
  File "/usr/lib64/python2.4/site-packages/trac/mimeview/api.py", line 330, in convert_content
    output = converter.convert_content(req, mimetype, content, ck)
  File "build/bdist.linux-x86_64/egg/pagetopdf/pagetopdf.py", line 20, in convert_content
  File "/usr/lib64/python2.4/encodings/iso8859_15.py", line 18, in encode
    return codecs.charmap_encode(input,errors,encoding_map)
UnicodeEncodeError: 'charmap' codec can't encode character u'\xb4' in position 4: character maps to <undefined>

The version of htmldoc is

htmldoc --version
1.8.24 Open Source

I am using the workflow branch from revision 3378.

Attachments (0)

Change History (4)

comment:1 Changed 8 years ago by athomas

  • Resolution set to invalid
  • Status changed from new to closed

According to this post, which may or may not be correct, the ACUTE ACCENT cannot be represented in ISO-8859-15 at all:

$ echo -en '\xc2\xb4' | iconv -f utf-8 -t iso-8859-15

iconv: illegal input sequence at position 0

You will have to change your codepage to something else, or use an alternate character.

comment:2 follow-up: Changed 8 years ago by mankoff

#1133 was marked as a dup of this, but the page linked to with this bug in #1133 does not have accent characters.

comment:3 in reply to: ↑ 2 Changed 7 years ago by anonymous

Replying to mankoff:

#1133 was marked as a dup of this, but the page linked to with this bug in #1133 does not have accent characters.

It's not the exact same character, but it's the same basic issue. The character '\u2019' is a RIGHT SINGLE QUOTATION MARK which is also unencodable in iso-8859-15.

comment:4 Changed 7 years ago by mankoff

  • Cc mankoff+pdfplugin@… added

OK I get it. But what is the solution? I've set my codepage to be something different than iso-8859-15 like so:

[pagetopdf]
size = A4
charset = UTF-8

but still get the same error.

Is there an easier fix than editing all pages with those stupid smart-quotes that got pasted in from somewhere else?

Add Comment

Modify Ticket

Action
as closed .
The resolution will be deleted. Next status will be 'reopened'.
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.