to do updated

This commit is contained in:
Philip Sargent 2022-03-24 01:16:43 +00:00
parent be410d4d9d
commit a514355e5e

View File

@ -26,17 +26,14 @@ Parses and imports logbooks in all their wonderful confusion
todo='''
- Put the object store 'trips' and the 'logdataissues' into TROG global object
- works parsing logbooks but when reading cache files fails on storing data
and is slower than parsing from scratch now!
- Use the .shelve.db cache for all logbooks, not just individually
- refactor everything with some urgency, esp. LoadLogbookForExpedition()
-- far too many uses of Django field dereferencing to get values, which is SLOW
- Logbooks log.htm exist for 1983, 84, 85, 87, 88, 89 but have no full-working parser,
or it is broken/incomplete and need hand-editing.
- Loogbooks 1987, 1988, 1989 all crash on MySql - but not sqlite - with db constraint fail
- Loogbooks 1987, 1988, 1989 all crash on MySql - but not sqlite - with db constraint fail. Edit logbook to fix.
- import/parse/re-export-as-html the 'artisanal-format' old logbooks so that
we keep only a modern HTML05 format. Then we can retiure the old parsers and reduce the
@ -303,7 +300,7 @@ def Parseloghtmltxt(year, expedition, txt):
for trippara in tripparas:
logbook_entry_count += 1
tid = set_trip_id(year,logbook_entry_count)
print(f' - new tid:{tid} lbe count: {logbook_entry_count}')
# print(f' - new tid:{tid} lbe count: {logbook_entry_count}')
s = re.match(r'''(?x)(?:\s*<div\sclass="tripdate"\sid=".*?">.*?</div>\s*<p>)? # second date
\s*(?:<a\s+id="(.*?)"\s*/>\s*</a>)?