CUCC Expedition Handbook - Online systems

Expo Data Maintenance Manual

Expo data management programmers' manual

This page is not for cavers wanting to know how to record their cave survey data.
This page is not for cavers wanting to know how to type in logbooks or upload photographs.
This page is for programmers who are helping cavers do their thing and setting up their own laptop.

Editing the expo data management system is an adventure. Learning it by trial and error is non-trivial. There are lots of things we could improve about the system, and anyone with some computer nous is very welcome to muck in. It is slowly getting better organised.

This manual is organized in a how-to sort of style. The categories, rather than referring to specific elements of the data management system, refer to processes that a maintainer would want to do.

Note that to display the survey data you will need a copy of the survex software.

Go elsewhere if this is what you want to know:

Appendices:

Website history - a history of the data management system up to 2019
Taking Expo Bullshit into the 21st Century - initial report from 1996

Getting a username, password and key

You don't need a password to view most things, but you will need one to change them.

Use these credentials for access to the troggle site. The user is 'expo', with a cavey:beery password. Ask someone if this isn't enough clue for you. This password is important for security. The whole site will get hacked by spammers or worse if you are not careful with it. Use a secure method for passing it on to others that need to know (i.e not unencrypted email), don't publish it anywhere, don't check it in to the data management system by accident. A lot of people use it and changing it is a pain for everyone so do take a bit of care.

This password is all you need to log in to troggle and to use the troggle control panel (very few people need to do this). But if you want to update webpages (a much more common requirement) or to edit the software itself (very rare), then you will also need to get a cryptographic key and register it with the server. See key exchange for details.

Unfortunately, pushing cave data to the ::loser:: and ::drawings:: repositories also needs a key. So cavers entering their cave survey data currently have to use a machine on which this already set up. These machines are the expo laptop and the laptop 'aziraphale' which live in the potato hut during expo. If you want to use your own laptop then see below.

The repositories

All the expo data is contained in 4 "repositories" at expo.survex.com. This is currently hosted on a free virtual server we have blagged on a server farm. We use a distributed version control system (DVCS) to manage these repositories because this allows simultaneous collaborative editing and keeps track of all changes so we can roll back and have branches if needed.

The site has been split into four parts:

loser - the survex cave survey data
drawings - the tunnel and therion cave data and drawings
expoweb - the website pages, handbook, generation scripts
troggle - the database/software part of the survey data management system - see notes on troggle for further explanation

We have migrated two of these to git but the other two still use mercurial.

Mercurial Website Hack 2019

Currently (December 2019) after commiting and pushing your changes to expoweb to the mercurial server, you will need to login to expo.survex.com using ssh, cd to /expoweb/ and issue a "hg update" command to make your changes noticed by the webserver. This problem will go away before Expo 2020 - we hope - when we finish migrating from mercurial to git.

All the scans, photos, presentations, fat documents and videos are stored just as files (not in version control) in 'expofiles'. See below for details on that.

How the data management system works

Part of the data management system is static HTML, but quite a lot is generated by scripts and troggle (a web framework built using Django).

Examples of troggle-generated pages from data:

expo.survex.com/caves - list of caves surveyed and links to guidebook descriptions
< href="http://expo.survex.com/pubs.htm">http://expo.survex.com/pubs.htm - reports, accounts and logbooks
expo.survex.com/expedition/2018 - Members on expo 2018: . Scroll down for a list of all the data typed in from survey trips.
expo.survex.com/survexfile/caves/ - List of caves with all the surveys done for each.
expo.survex.com/survexfile/caves-1623/115/cucc/futility.svx - Cave survey data from 1983 in Schnellzughohle.
expo.survex.com/survey_scans/ - List of all scanned original survey notes.
expo.survex.com/survey_scans/2018%252343/ - list of links to scanned notes for wallet #43 during the 2018 expo.

Anything you check in which affects cave data or descriptions won't appear on the site until the data management system update scripts are run. This happens automatically every 30 mins, but you can also kick off a manual update. See 'The expoweb-update script' below for details.

Also note that the ::expoweb:: web pages and cave data reports you see on the visible website are not the same as the version-controlled "master" expoweb repo. So in order that your committed and pushed changes become visible on the website, they have to be 'pulled' from the repo onto the webserver before your changes are reflected.

Your own laptop

Setting your own laptop so that it can do everything the expo laptop can do is quite a complicated process. At a minimum you will be an experienced software nerd already and will have git, mercurial and a text editor installed and you will know how to use them. You will have done the key exchange process - which you can only do entirely on your own if you have access to the expo laptop.

See setting up your own laptop for the full list of software we use and where to get it.

Note that the instructions are primarily for people using Linux with some help for those using Windows. If you are a Mac user then you are on your own.

Using 'Edit This Page'

This can be used to edit web pages without installing any software or doing any key exchange. It even works if your laptop is a Mac.

This is the capability that you can see in the top-left-hand menu on any website page if you log in to troggle using the cavey:beery password.

'Edit This Page' is a troggle capability edits the file served by the webserver but it does not update the copy of the file in the repository (the invese of the problem described above as 'Mercurial Website Hack'). To properly finish the job you need to

ssh into expo@expo.survex.com (use putty on a Windows machine)
cd to the directory containing the repo you want, i.e. "cd loser" for cave data or "cd expoweb" for the handbook and visible data management system, which takes you to /home/expo/expoweb
Then run "hg status" (to check what changes are pending),
then "hg diff" to see the changes in detail (or "hg diff|less" if you know how to use "less" or "more") and
then DO NOT just run 'hg commit' unless you know how emacs works as it will dump you into an emacs editing window (C-x C-C is the way to exit emacs). Instead, do 'hg commit -m "found files left over - myName" ' which submits the obligatory comment witht he commit operation.

Again, we hope that this issue will go away when we migrate the expoweb repo from mercurial to git before the 2020 Expo.

Quick start

If you know what you are doing here is the basic info on what's where:
(if you don't know what you're doing, skip to Editing the data management system below.)

This section is all about how to use mercurial. Since we are changing to git it has been removed to a separate place.

expofiles (all the big files and documents)

Editing the data management system

To edit the data management system fully, you need to use the disributed version control system (DVCM) software which is currently mercurial/TortoiseHg. Some (static text) pages can be edited directly on-line using the 'edit this page link' which you'll see if you are logged into troggle. In general the dynamically-generated pages, such as those describing caves which are generated from the cave survey data, can not be edited in this way, but forms are provided for some types of these like 'caves'.

[ui] username = Firstname Lastname <myemail@example.com>

The commit has stored the changes in your local Mercurial DVCS, but it has not sent anything back to the server. To do that you need to:

hg push

Before pushing, you should do an hg pull to sync with upstream first. If someone else has edited the same files you may also need to do:

hg merge

(and sort out any conflicts if you've both edited the same file) before pushing again

Simple changes to static files will take effect immediately, but changes to dynamically-generated files (cave descriptions, QM lists etc) will not take effect, until the server runs the expoweb-update script.

The expoweb-update script

The script at the heart of the data management system update mechanism is a makefile that runs the various generation scripts. It is run every 15 minutes as a cron job (at 0,15,30 and 45 past the hour), but if you want to force an update more quickly you can run it he

The scripts are generally under the 'noinfo' section of the site just because that has (had) some access control. This will get changed to something more sensible at some point

Updating cave pages

Cave description pages are automatically generated from a set of cave files in noinfo/cave_data/ and noinfo/entrance_data/. These files are named -.html (where area is 1623 or 1626). These files are processed by troggle. Use python databaseReset.py caves in /expofiles/troggle/ to update the site/database after editing these files.

Clicking on 'New cave' (at the bottom of the cave index) lets you enter a new cave. Info on how to enter new caves has been split into its own page.

(If you remember something about CAVETAB2.CSV for editing caves, that was superseded in 2012).

This may be a useful reminder of what is in a survex file how to create a survex file.

Updating expo year pages

Each year's expo has a documentation index which is in the folder

/expoweb/years

, so to checkout the 2011 page, for example, you would use

hg clone ssh://expo@expo.survex.com/expoweb/years/2011

Once you have pushed your changes to the repository you need to update the server's local copies, by ssh into the server and running hg update in the expoweb folder.

Adding a new year

Edit folk/folk.csv, adding the new year to the end of the header line, a new column, with just a comma (blank cell) for people who weren't there, a 1 for people who were there, and a -1 for people who were there but didn't go caving. Add new lines for new people, with the right number of columns.

This proces is tedious and error-prone and ripe for improvement. Adding a list of people, fro the bier book, and their aliases would be a lot better, but some way to make sure that names match with previous years would be good.

Ticking off QMs

To be written.

Maintaining the survey status table

There is a table in the survey book which has a list of all the surveys and whether or not they have been drawn up, and some other info.

This is generated by the script tablizebyname-csv.pl from the input file Surveys.csv