aboutsummaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorFrancis Irving <francis@mysociety.org>2010-09-30 17:47:25 +0100
committerFrancis Irving <francis@mysociety.org>2010-09-30 17:47:25 +0100
commit4dc10e85aec60d10c2364e1171b392650bfd096e (patch)
tree3adab80ff36bf69abb35808b2864a9e8b568ff04
parent264f486cdb6198155d1982e017a6b4e665b4fe39 (diff)
Software note.
-rw-r--r--todo.txt3
1 files changed, 3 insertions, 0 deletions
diff --git a/todo.txt b/todo.txt
index dfc6c3886..b12b1ec63 100644
--- a/todo.txt
+++ b/todo.txt
@@ -346,6 +346,9 @@ Failed to detect attachments are emails and decode them:
When indexing .docx do you need to index docProps/custom.xml and docProps/app.xml
as well as word/document.xml ? (thread on xapian-discuss does so)
+Consider using odt2txt or unoconv
+http://www-verimag.imag.fr/~moy/opendocument/
+
Mime type / extension wrong on these .docx's
http://www.whatdotheyknow.com/request/bridleway_classifications
http://www.whatdotheyknow.com/request/19976/response/51468/attach/3/TU%20MembershipTeachers%20SolidarityTU%20231009.docx.doc (thinks it is doc when it is docx)