User:Kerry Raymond/QHR

The Queenland Heritage Register is available in two main formats:


 * Website: https://heritage-register.ehp.qld.gov.au/ (Copyright)
 * XML: https://data.qld.gov.au/dataset/the-queensland-heritage-register/resource/43ff6bb2-8dfb-4d87-90ab-b89cb20c7ef7 (CC-BY)

There are 1692 entries, either as individual web pages (HTML) or as place tags within the XML file.

Following are the fields of Queensland Heritage Register as they appear in the XML and web versions. The XML is a subset of the website content, but is CC-BY licensed and contains the narrative content (history and description) which is the main concern for copyright considerations. However, the remaining fields on the website are of a "fact" character and therefore cannot be copyright and could be incorporated by web-scraping.

Site ID
https://heritage-register.ehp.qld.gov.au/placeDetail.html?siteId=22 
 * The siteID parameter in the URL of the web entry (it does not appear on the web page itself), e.g.
 * The numeric id attribute of the place tag in the XML file

This field is not part of the QHR entry and is an artefact of the current database used to hold the register. Therefore, this field is not guaranteed to be persistent, but is currently used as the key field for many practical purposes.

There is only one value for this field per entry and it is unique within the QHR.

Place ID
Place ID     602764 602764
 * The Place ID field on the website, seen in HTML as:
 * The numeric content within the place_ref tag within the place tag in the XML file, e.g.

There is only one value for this field per entry and it is unique.

Registration Type
The primary classification for heritage classification. Can be one of three values: "State Heritage" (1676 entries), "Archaeological" (14 entries) and "Protected Area" (2 entries)

Registration Type State Heritage   
 * Registration Type field on the website:
 * type attribute of the place tag in XML

There is only one value for this field per entry.

Place name
The natural language name of the entry:

Place Name Kingaroy Peanut Silos Kingaroy Peanut Silos
 * The Place Name on the website:
 * in the XML

Cardinality: There is only one value for this field per entry and it is mostly unique within the heritage but some names are not unique (names and cardinalities below): 6        Queensland National Bank (former) 3        St Patricks Church 3        Residence 3        National Australia Bank 3        Criterion Hotel 2        Woodlands 2        Tree of Knowledge 2        St Marks Anglican Church 2        St John's Church 2        St George's Anglican Church 2        St Brigids Church 2        St Andrews Uniting Church 2        Queen's Park 2        Queens Gardens 2        Holy Trinity Church 2        Holy Trinity Anglican Church 2        Grand Hotel 2        First World War Honour Board 2        Bishop's House 2        Bank of New South Wales (former) 2        Australian Joint Stock Bank (former) 2        All Saints Anglican Church

How the name is chosen is a mystery to me. Sometimes it is an historic name, sometimes it is the name at the time of registration, sometimes it is the name of the owner/occupier at the time of registration, sometimes it is a common name, sometimes it is a street address, sometimes it is the usage of the site. In many instances the place name includes "Former" or "(former)" reflecting that the name in the heritage register is not the current name.

Transformation: Place name is the most likely field to use for the WP article title. In most cases it is sufficiently descriptive and unambiguous for use, e.g. "Kingaroy Peanut Silos". In other cases it is sufficiently descriptive for an article title but is ambiguous within WP, e.g. "St James Cathedral"; in these cases, disambiguation by appending the suburb/locality/town will usually suffice. Street address names are not ideal as article titles and any alternative names should be considered; failing that, street address names may need to have subuurb/locality/town appended either to be more descriptive or to disambiguate. Finally, there are names like "Residence" which are not sufficiently descriptive and likely to be ambiguous, and the use of alternative names (if any) should be considered; appending street address would be a fallback. The selection of article titles probably has to be manual given all these considerations, but tools can be used to identify those names where there is an existing WP article or WP disambiguation which will influence the choices. There is a need to track the article title chosen as some cross-referencing occurs (usually with the History and Description fields) which would need to be rendered as wikilinks.

Generally "former" parts of names are probably not appropriate for Wikipedia article titles and would more likely be captured in the lede para, e.g. "The Queensland National Bank is a heritage-listed former bank ...". On the other hand the use of "old" in a place name generally indicates the common name and would normally be included in the article title "Old Government House".

Alternate name
Alternate names are present for heritage places with multiple names for whatever reason. There are 1240 alternate names in the QHR. Alternative Name Klondyke Beehive Coke Ovens Klondyke Coking Ovens
 * in HTML


 * in XML

Klondyke Beehive Coke Ovens Klondyke Coking Ovens

Cardinality: There can be 0, 1, or more alternative names per entry. Like place names, alternative names are mostly unique within the QHR but not always. Some alternative names are the places names of other QHR sites.

Transformation: Alternate names may be preferable to the place name as article titles in some circumstances. Place names or alternate names not used as article titles might appear in the lede paragraph and might be established as redirects. However, in some cases, the alternate names are only minor variations of place names (e.g. "Klondyke Coke Ovens", "Klondyke Beehive Coe Ovens", "Klondyke Coking Ovens") and might not warrant mention in the lede nor redirection. However, in other cases, the names are very different: "99 Grafton St", "Former Cairns Chinatown", "Ruth Women's Bookshop", "Andrew's Barber Shop" and would require explicit mention in the lede and as redirects. Note that any redirects might also have issues of ambiguity either with QHR articles (some alternate names are the place names of other heritage sites) or with other Wikipedia articles more generally.

Place Classification

 * not present in the XML
 * not present in the XML