DHCrowdScribe: Building Scholarly Resources for Wider Public Engagement. A #DHCOxf Archive
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
'Building Scholarly Resources for Wider Public Engagement' was a full day workshop that took place at the Radcliffe Observatory Quarter, Oxford University, Oxford, on Friday 13 June 2014.
The hashtag for the event was #DHCOxf.
It was organised by DHCrowdscribe, the online hub for the output of the AHRC-funded Collaborative Skills Project ‘Promoting Interdisciplinary Engagement in the Digital Humanities’.
Speakers were: Matt Vitins and Anna Crowe ( Legal and Ethical Issues in the Digital Humanities), Dr Stuart Dunn (Crowdsourcing), Dr Robert Simpson (Zooniverse), Dr Ernesto Priego and Dr James Baker (Sharing Data from a Researcher’s Perspective), Michael Popham, Dr Ylva Berglund Prytz (Digitising the Humanities and Engaging with the Public), Judith Siefring (Early English Books Online Text Creation Partnership), David Tomkins (Bodleian Digital Library), Dr Robert Mcnamee (Electronic Enlightenment Project), Dr Stewart Brookes (‘Getting
Medieval, Getting Palaeography: The DigiPal Database of Anglo-Saxon Manuscripts), Dr Michael Athanson (ArcGIS and Mapping the Humanities) Professor David de Roure (Scholarly Social Machines), and Professor Howard Hotson.
This .XLS file contains Tweets tagged with #DHCOxf (case not sensitive).
The archive shared here contains 692 Tweets dated 13 June 2014 (the day the event took place). There were definitely more Tweets tagged #DHCOxf, but this was the closest I got to compiling a more or less complete set dated 13 June 2014.
The Tweets contained in this file were collected using Martin Hawksey’s TAGS 5.1. The file contains two sheets:
Sheet 0. The 'Cite Me' sheet, including procedence of the file, citation information, information about its contents, the methods employed and some context.
Sheet 1. The Archive containing 692 Tweets dated 13 June 2014.
To avoid spam only users with at least 2 followers were included in the archive. Retweets have been included.
Please note that both research and experience show that the Twitter search API isn't 100% reliable. Large tweet volumes affect the search collection process. The API might "over-represent the more central users", not offering "an accurate picture of peripheral activity" (González-Bailón, Sandra, et al. 2012). Therefore, it cannot be guaranteed this file contains each and every tweet tagged with #DHCOxf during the indicated period.
Some deduplication and refining has been performed to avoid spam tweets and duplication. Some characters in some Tweets' text might not have been decoded correctly.
Please note the data in this file is likely to require further refining and even deduplication. The data is shared as is. If you use or refer to this data in any way please cite and link back using the citation information above.
All the data collected in this small dataset was willingly made freely, openly and publicly available online by users via Twitter and therefore was and still is openly and freely available through several other methods and services. It has been shared here in a curated form for educational and research use and no copyright or privacy infringement is intended or should be inferred.
This file was created and shared by Ernesto Priego (Centre for Information Science, City University London) with a Creative Commons- Attribution license (CC-BY).
If you use or refer to this data in any way please cite and link back using the citation information above.
[Please make sure you are looking at the latest version of the file as earlier versions contained unfortunate typos].