Ticket #2143 (closed defect: fixed)

Opened 4 years ago

Last modified 1 year ago

[Patch] Indexer _strip wrong regexp (non-ascii char issue)

Reported by: slach@74mail.ru Assigned to: rjollos
Priority: normal Component: RepoSearchPlugin
Severity: normal Keywords:
Cc: Trac Release: 0.10

Description

indexer.py line 193

_strip = re.compile(r'\w+')

must be replace _strip = re.compile('[\w_]+',re.U)

Attachments

Change History

11/11/07 06:20:26 changed by athomas

  • status changed from new to closed.
  • resolution set to invalid.

_ is already included by \w.

11/14/07 16:23:12 changed by anonymous

  • status changed from closed to reopened.
  • resolution deleted.

when not using re.U, non-ascii character not indexed

02/07/10 08:55:39 changed by rjollos

Reassigning ticket after changing maintainer of plugin.

02/07/10 08:56:27 changed by rjollos

  • owner changed from athomas to rjollos.
  • status changed from reopened to new.

02/07/10 10:08:37 changed by rjollos

  • summary changed from Indexer _strip wrong regexp to Indexer _strip wrong regexp (non-ascii char issue).

There seem to be numerous reports related to us of non-ascii characters. I will test out your fix, but could really use someone to test on their non-english installation.

02/07/10 10:14:24 changed by rjollos

  • summary changed from Indexer _strip wrong regexp (non-ascii char issue) to [Patch] Indexer _strip wrong regexp (non-ascii char issue).

08/30/10 07:34:49 changed by rjollos

  • status changed from new to closed.
  • resolution set to fixed.

(In [8523]) Fixed problem with indexing non-ascii characters. Fixes #2143.


Add/Change #2143 ([Patch] Indexer _strip wrong regexp (non-ascii char issue))




Change Properties
Action