Category Archives: Activity

Online Enthusiast Communities in the UK Web Archive

There is a saying that ‘variety is the spice of life’ and this is certainly true when you think of the types of hobbies and interests the UK public engages in. There are the hobbies we have all probably heard of such as train spotting or metal detecting and there are the more obscure ones such as Poohsticks or Hand Dryer appreciation.  Websites are a useful tool for enthusiasts to communicate and share their passion with the world. At the UK Web Archive (UKWA) the Online Enthusiast Communities  collection aims to:

‘Capture how UK based public forums are used to discuss hobbies and activities and serve as a place for enthusiasts to converse with others sharing similar interests.’

This collection includes such a diverse and wonderful selection of websites and forums. I can honestly say that curating this collection has truly been a joy – there are probably very few jobs that allow you to look at The Letter Box Study Group (a website about the history and development of British roadside letter boxes) as part of your tasks for the day.

Differences I have noticed

As a curator you get to explore lots of sites and you begin to notice differences and similarities between websites. It is interesting to see the variety in website design and levels of expertise and to me it feels like this is reflected in the websites that are archived.

I have noticed lots of online communities using a variety of website builders. The huge diversity in tools appear to have made it easier to create more professional looking sites with ease. Compared to older sites, you notice:

  • the increased use of images
  • cleaner feel
  • neutral backgrounds
  • minimal text
  • occasional e-commerce sections

However, it is nostalgic to see some of the older more ‘blocky’ sites, as I do remember the days of dial-up internet access and early web sites. To me, forums tend to have a similar feel and the designs does not deviate greatly from each other.

I have also found how often a website updates intriguing. Some are regularly updated whereas others appear to have been untouched for several years. This may reflect that many websites are run by volunteers balancing other commitments. Regularity of updates is an important factor as it will contribute to deciding how often we capture the site – it is the skill of a web archivist to judge this accordingly however these frequencies can be updated.

Some of my Favourite sites

One of the joys of curating this collection is that you get to experience sites that are really unique that you would not normally explore. I wanted to highlight a few of the sites that particularly caught my attention, specifically from the ‘Miscellaneous’ sub section as this is my personal favourite.

Pylon of the Month

Pylon of the month (February 2018) from Sweden. Image Credit: Kristin Allardh, 2018

This is a site dedicated to electricity pylons highlighting a monthly winner. These could include current pylons or historic images and entries can come from the UK and beyond. Images are usually accompanied by some interesting history or facts.

Modernist Britain

Odeon cinema Leicester, Leicestershire. Image Credit: Richard Coltman, 2010

This site is beautifully designed and celebrates modernist architecture in Britain. There are fifty illustrated images with accompanying information about the history of the buildings and photographs taken by Richard Coltman.

Cloud appreciation society

A Lenticular cloud. Image Credit: © José Ramón Sáez, 2019

This site was launched in 2005 with the aim of ‘bringing together people who love the sky’. It has an international membership with members submitting images from all over the world. They also run events, cloud related news and in 2019 they are contributing to the non-profit FogQuest project.

The online enthusiast community is also very witty, there are some fantastically named sites and forums such as:

  • Planet of the Vapes – a forum about vaping
  • DIYnot Forum – a forum about DIY
  • Frit-Happens! – an online community for glass blowing and glass crafting

Curating the online enthusiast collection has been incredibly enjoyable. Having to actively seek new sites has made me more aware of the variety of hobbies and diversity of interests the public engage in.

As this collection develops, more sites relating to the variety of hobbies and interests will be captured and persevered for future generations explore, enjoy and research. However, due to the size, complexity and technological challenges of archiving all UK websites, some may get missed or we just do not know about them . If there is a site that you think should be included then you can nominate it on the ‘Save a UK website‘ page of the UKWA.

Opening the Edgeworth Papers

The Bodleian Libraries hold a rich and varied collection of papers related to the Edgeworth family from the 17th to the 19th century. Only a tiny percentage of the material contained therein is available in print and even less has been subject to scholarly editing.

The collection may be little known, but it is of great significance, providing vital evidence (manuscript drafts and correspondence) about the literary career of one of the most important novelists of the early 19th century, Maria Edgeworth (1768-1849). Maria’s work is also placed in context by additional documentation that covers the educational, agricultural and political theory and practice of her father, the politician, writer and inventor Richard Lovell Edgeworth (1744-1817).

Engraving of Maria Edgeworth (MS. Eng. misc. c. 901, fol. 28).

Through assorted written material, the collection shows the ways in which an extended family with connections in Ireland, England, France and India, communicated and collaborated in the production of art, literature, and scientific knowledge. And it sheds light on Anglo-Irish relations during a period of political contestation and transformation.

Over the next 12 months we will investigate ways of raising the profile of this collection through social media, scholarly and digital editing.  The project takes one selection of the material in the Edgeworth papers— correspondence and other evidence related to the year 1819-1820— and tracks it alongside 2019-2020, a momentous period in the history of the relations between Britain and Europe. Each month, our blog will present sample documents from the same month 200 years earlier. Writing in March 2019, as the UK faces huge political upheaval, let us introduce you to Maria and her family, who in March 1819 are in the midst of a personal – rather than political – challenge on both sides of the Irish Sea.

Love and Marriage: A Family Affair

As the old song says, love and marriage go together like a horse and carriage. But in the early 19th century, ‘love’ wasn’t the key concern. The idea of the ‘marriage market’ brings home the financial considerations of matrimony in the period. For women, this was particularly acute. The financial and legal implications of an imprudent marriage were serious – it was, after all, impossible to get a divorce without first obtaining a private Act of Parliament.

It is no wonder families were so invested in securing the right matches for their children – and no surprise that so many novels dramatised the intrigues, concerns and implications of the marriage market in the ‘courtship plot’. Lady Russell in Jane Austen’s Persuasion (1818), for example, convinces heroine Anne Elliot not to marry the nobody Frederick Wentworth as this would present too much of a social risk. When Wentworth returns a Captain, Lady Russell’s opposition comes across as snobbish and intrusive. In the context of 19th-century marriage laws and women’s rights, Lady Russell’s concern is sincere. Today marriage comes under the umbrella of ‘personal relationships’, but 200 years ago matrimony was very much a family affair.

In March 1819, bestselling novelist Maria Edgeworth was embroiled in her own family affair that could have come straight from a novel like Persuasion. Her young half-sister, Fanny, some 30 years Maria’s junior, was being courted by a man whose morals her family admired but whose personality they considered rather dull: the ‘Mr. L.W.’ [Lestock Wilson] of 31 Harley Street. Fanny, Maria and another half-sister, Honora (1791-1858) who was only eight years older than Fanny, were visiting London together. Maria hurriedly wrote home to Edgworthstown, Ireland, to her step-mother– and Fanny’s mother – Frances Ann Beaufort (1769–1865), her ‘dearest mother’ (in fact one year younger than Maria herself) – to discuss what to do. Believing Mr LW to be unsuitable, Maria sought to dazzle Fanny by opening the country-educated girl to the best of London society. She had herself refused a proposal of marriage in 1802 from the Swedish intellectual, Abraham Niclas Clewberg-Edelcrantz (1754-1821), who she met on a family visit to Paris, lacking the confidence to leave the family she loved so dearly for an uncertain union.

Drawing of Fanny Edgeworth as a young child by her mother Frances Edgeworth (MS. Eng. misc. c. 901, fol. 8).

Drawing of Fanny Edgeworth as a young adult by her mother Frances Edgeworth (MS. Eng. misc. c. 901, fol. 9).

The urgent tone of this letter bespeaks the need to act quickly and decisively. Both Maria and Frances are wary of Fanny accepting the invitation to Mr LW’s house, though she was desirous to ‘see & judge for herself’. Despite LW’s protestations that ‘he would not behave to her as a lover or pay her any peculiar attention’, such a visit would be ill-advised: as Maria contends, it would be neither ‘prudent’ nor ‘proper’.

Letter from Maria Edgeworth to Frances Edgeworth (MS. Eng. lett. d. 696, fol. 146r).

Letter from Maria Edgeworth to Frances Edgeworth (MS. Eng. lett. d. 696, fol. 146v).

Letter from Maria Edgeworth to Frances Edgeworth (MS. Eng. lett. d. 696, fol. 147r).

Letter from Maria Edgeworth to Frances Edgeworth (MS. Eng. lett. d. 696, fol. 147v).

Maria’s concern is that a strong romantic inclination may not be sufficient to ‘secure Fanny’s permanent happiness’. Admittedly, Maria does not relish her ‘Duenna’ (chaperone) role, but writes that ‘this is to me as a feather in the balance compared with the object in view’.  Convinced of Mr LW’s unsuitability, the Edgeworths sought to protect Fanny from a marriage that she wouldn’t be able to leave. The following month, Fanny refused him – but she regretted and mourned her decision, accepting his renewed proposal some ten years later.

This letter also gives us an insight into the complex generational dynamics of the Edgeworth family. Maria’s father, Richard Lovell Edgeworth, married four times and had 22 children. Richard’s death two years before these events left Maria and his fourth wife, Frances, to direct the family drama. Maria takes on her father’s mantle (she’d had early experience in helping him manage his family estate) and adopts a paternal role in agreeing with Fanny’s mother the best way forward.

Transcript of letter:

Dearest mother   On our return from breakfasting with M.rs Marcet (where we met M.r Mallet) our packet of letters was put into our hands & we ran to our own fireside to devour the con – Lest I should not have time to say more let me make sure of the most important thing I have to say. That I entirely agree with you that it would neither be prudent in her present cir=cumstances nor proper in the eyes of the world for Fanny to part company from me to go even for a few days alone to 31 Harley St.t – This having been my opinion before I knew it was yours and being streng=thened by the decided expressions In your letter to Fanny just rec.d I have advised her by no means to go there alone till at least till we hear again from you – She will or has told you  what passed between M.r L W and her yesterday morning – in consequence of his promise that if she were in the house with him he would not behave to her as a lover or pay her any peculiar attention she wished to spend some days at Harley S.t without Honora or me that she might see & judge for herself.

When I told her my reasons against this – & in particular stated repeated to her the advice my father gave me not to trust myself alone with a man in whose favor my inclinations spoke more than my judgment Fanny most prudently & kindly has yielded to me her wish & says she is quite convinced by my reasons & therefore was unwilling to write to ask your opinion further – that is to ask you whether in consequence of [what] has since passed between her & L W the circumstances are so far altered that you would advise her to go there by her=self – They have but one small spare room & therefore F — ^anny^ says cannot ask us to be with her but that objection c.d I think be easily waived for I don’t care into what space I am crammed – I can sleep in the bed with her – Honora could for a week & would I am sure go to Sneyd – We cannot all have at every moment what is most agreeable But Honora I am sure would be as willing as I am to do what may not be agreeable for the time to secure Fanny’s permannent happiness – You may guess how disagreeable it will be to thrust myself into a house Duenna=ways – the maiden’s steps to haunt & in society that cannot relish me at any time – but [xxx] this is to me as a feather In the balance compared with the object in view –

I advise that she should remain with me to the end of  the fortnight at Lady E W’s – that she sh.d dine then go with me to M.rs Carr’s Hampstead or M.rs Baillie’s or wherever we next deter=mine to go for another week or so – and then if the Wilsons ask me to go with her to Harley S.t I am ready to go if you approve & to stay as long or as short a time as Fanny wishes.

Answer me very distinctly and decidedly my dearest friend these Questions Do you approve of my going with F to 31 Harley S.t to stay some time – or Do you approve or not of Fanny’s going there by herself – I cannot write or think on any other subject at present

truly affectionately yrs,

Maria E

The blended Edgeworth clan – consisting of several step-mothers, numerous half-siblings – provided a whole series of domestic dramas, revealing surprising alliances, deep loyalties and often lively comedy. Over the next 12 months we look forward to opening the Edgeworth papers, uncovering their stories, and sharing them with you.

Opening the Edgeworth Papers: the team

Ros Ballaster, Professor of Eighteenth-Century Studies, Faculty of English and Mansfield College, University of Oxford

Catriona Cannon, Deputy Librarian and Keeper of Collections, Bodleian Library

Anna Senkiw, Research Assistant

Ben Wilkinson-Turnbull, Research Assistant

Follow us on Twitter @EdgeworthPapers

Developing collections on Gender Equality at the UK Web Archive

The Gender Equality collection

The UK web archive Gender Equality collection and its themed subsections provide a rich insight into attitudes and approaches towards gender equality in contemporary UK society and culture. This was previously discussed in my last blog post about the collection, which you can read here.

Curating the collection

A great deal of the discussion and activity relating to gender equality occurs predominantly in an online space. This means that as a curator for the Gender Equality collection, the harvest is plenty! The type of content being collected by the UK Web Archive includes:

Of course there is some crossover, not only regarding the type of content but also within subsections of the gender equality collection.

This image is made available and reproduced by CC-BY-NC-SA 2.0. [https://creativecommons.org/licenses/by-nc-sa/2.0/legalcode]

Specifically, I find the event sites in the collection really interesting. As well as documenting that the event(s) even existed and happened in the first place, they can give us a snapshot of who organised the event, as well as who the intended audience were. Also, the collection exhibits the evolution of websites related to gender equality over time (which can be very speedy indeed when it comes to sites like twitter accounts!), and the changing priorities, trends, initiatives and more that can tell us about attitudes towards gender equality in the UK. These kinds of websites are being created by and engaged with by humans right now.

Nominate a website!

The endeavour of the UK Web Archive never stops – if you would like to help grow the Gender Equality collection (or indeed, any other collections) click here to nominate a website to save. Go on…whilst you’re at it, you can explore the UK Web Archive’s funky new interface!

 

Image reference: Workers Solidarity Movement (2012) March for Choice

 

Festivals in the UK Web Archive

Live events are funny things; can their spirit be captured or do you have to “be there to get it”? Personally I don’t think you can, so why are we archiving festival websites?

Running throughout the year, though most tend to be clustered around the short UK summer, festivals form a huge part of the UK’s contemporary cultural scene.  While it’s often the big music festivals that come to mind such as Glastonbury and Reading or perhaps the more local CAMRA sponsored beer and cider festivals; these days there is a festival for pretty much everything under the sun.

UK Web Archive topics and themes

In part this explosion of festivals from the very local and niche to the mainstream and brand sponsored has been helped by the internet. You can now find festivals dedicated to anything from bird watching to meat grilling to vintage motors.

With the number of tools and platforms available for website creation and event and bookings management and the rise of social media, it seems anyone with an idea can put on a festival. More importantly with increasing connectedness that the web gives us, the reach of these home grown festivals has become potentially global.

Of course most will remain small local events that go on until the organisers lose interest or money such as Blissfields in Winchester which had to cancel their 2018 event due to poor ticket sales. But some will make it big like Neverworld which started in 2006 in Lee Denny’s back garden while his parents were away for the week but now 10+ years on has sold out the 5000 capacity festival venue it has relocated to.

The UK Web Archive‘s Festivals collection attempts to capture the huge variety of UK festivals taking place each year and currently has around 1200 events being archived that are loosely categorised based around 15 common themes, though of course there is a great deal of crossover as they can be found combining themes such as:

In this collection of UK festivals sites, while we cannot capture the spirit of a live event we can still try to capture their transient nature. Here you can see their rise and fall, the photographs and comments left in their wake, and their impact on local communities over time. Hopefully these sites and their contents can still give future researchers a sometimes surprising and often candid snapshot of contemporary British culture.

Emily Chen

Wilfred Owen Archive: New catalogue

The Wilfred Owen archive has just been fully rehoused and catalogued, with a detailed list of items available online. The collection has had a lively existence thus far, with the bulk of it donated by Harold Owen in 1975 to the English Faculty Library. Wilfred’s cousin Leslie Gunston donated the Gunston collection in 1978. Small additions have been made since then, and the collection now includes the working papers and correspondence of two prominent Owen scholars, Dominic Hibberd and Jon Stallworthy. The entire collection was transferred to the Weston Library on 13 January 2016.

Following a month of work, the collection has been reordered and renumbered, although the former, widely-cited OEF (Oxford English Faculty) references are included in the catalogue, as are references to Jon Stallworthy’s transcripts in Wilfred Owen: The Complete Poems and Fragments (CPF).

Wilfred Owen’s literary papers make up the first six boxes (MSS. 12282/1-6) and include Wilfred’s original manuscripts (digital versions of which are available on the Word War I Poetry Digital Archive), allowing the reader to see the maturation of Owen’s poetry from the early ‘To Poesy’ to his masterpieces ‘Dulce et Decorum Est’, ‘Mental Cases’ and ‘Anthem for Dead Youth’. Drafts of poems that Wilfred sent to his cousin, Leslie Gunston, are also found in this part of the collection.

The archive also contains other primary source material relating to Wilfred. At MSS. 12282/34-5 there are original editions of The Hydra, a magazine published by the patients at Craiglockhart Hospital for Neurasthenic Officers where Wilfred was a patient in 1917. He edited several issues of the magazine and some of the copies have annotations by him, such as ‘With the Editor’s Compliments!’ School exercise books and correspondence are similarly preserved, and there is an extensive collection of objects and family possessions relating to Wilfred and his family. Many of the objects are extremely fragile and kept in a Reserved part of the collection, but they provide a tangible closeness to Wilfred. Found here are some of Tom Owen’s souvenirs from India, Susan Owen’s jewellery box, with locks of Wilfred’s baby hair, an old family clock, a boat handmade by Tom for Wilfred, and some binoculars belonging to Wilfred himself.

The photographs in the archive span from the late 19th century to the late 20th century, and include many generations of Wilfred’s mother’s family. The photos are arranged by size and subject and include photographs of Wilfred.

The remainder of the archive mostly consists of Harold Owen’s correspondence, press cuttings and working papers. These offer a fascinating insight into the life of Wilfred’s brother, Harold and highlight the way in which he controlled Wilfred’s reputation and that of the Owen family. His correspondence with admirers, scholars, publishers, libraries and museums uncovers the human face of archival acquisitions and posthumous literary fame. Harold’s biography, Journey from Obscurity, is found in this part of the collection, with a first draft of almost 1000 pages written by hand in Harold’s characteristic small capitals.

There are three later additions to the archive. The 1978 Gunston donation includes manuscripts dating back to the 19th century, letters, photographs and cartoons. Particularly charming are Leslie’s letters to his wife Norah, and the sketches contained in them.

The Owen scholar Dominic Hibberd gave his working files, which contain correspondence, press cuttings, photocopies and photographs, generated in the course of his research. Some of these items are dated as recently as 2002, and include new resources, such as photocopies of the birth, death and marriage certificates of Wilfred’s extended family.

Also present are Jon Stallworthy’s working files, which are comprised mostly of photocopies of the Owen manuscripts which he used to create his Complete Poems and Fragments.

Several items in particular caught my attention throughout the archiving process:

Items 83 and 102 in MS. 12282/7, folder 2 are two letters from Annie G Phillips to Harold Owen, dated November 1969. Annie is studying for her A levels, and writes to Harold of her admiration for Journey from Obscurity, his memoirs. She says that learning about the family life of the Owens has helped her understand Wilfred’s poetry on a deeper level, but she also makes some very personal connections. Like Wilfred, she cannot afford to go to university. Harold’s reply must have been kind because her follow-up letter is even more brimming with excitement. These exchanges really posit Harold as a living connection to Wilfred, a way for readers to access the poet, a way of keeping Wilfred alive. But this is of course exactly what Harold’s archival work did and does. His own papers are testimony to that process of preservation, and exist as items worthy of study in their own right. But these letters also left me wondering what happened to Annie Phillips, who must now be nearing 80. Did she ever go to university? Is she still reading Wilfred Owen?

Item 151 in MS. 12282 photogr. 3 is a postcard of Scarborough during the war, collected as part of a group of postcards of places connected to Wilfred Owen. It follows postcards of Bordeaux, Ripon, Ors, and many other places. The photographed place is the focus of these postcards, and very few have any writing on them. But item 151 dates from the First World War and has a message written to a ‘Miss Lucy Sunderland’ from ‘Daddy’. Archival work is never neutral, and the decision made to use this postcard in the collection represents a value judgement: the photographic record of a place is of greater importance than the message contained on the verso of the card. In the catalogue, I decided to include the information about the scribbled message in an attempt to balance out the conflicting demands placed upon this item. We’ll never know if Lucy’s Daddy made it back home again.

Item 16 in MS. 12282 objects 2 is a tiny cardboard box inside Susan Owen’s jewellery box. This tiny box contains two envelopes with the hair of Wilfred Owen inside. One of the locks of hair even had the shedded skin of a carpet beetle lodged within it! The hair itself was one of the most moving discoveries within the collection, with a tangibility that is both enticing and repulsive. But the manner of preservation was fascinating, too. The hair had originally been labelled in the envelopes and box by someone with a cursive hand, most likely Susan Owen herself, who would have been the one to cut Wilfred’s hair. The pencil marks had somewhat faded away, but one of the envelopes read ‘The hair of Sir Wilfred Edward Salter-Owen at the age of 11 ½ months in the year 1894’ For Susan, then, this was the act of a proud mother, keeping a memory of her son’s early years, to look back upon when he was older. But the cursive pencil writing is overshadowed by the characteristic small capitals in ink of Harold Owen. Harold labels the box as ‘The poet Wilfred Owen’s hair’. He displays an entirely different motive – to preserve the remains of a well-known literary figure. The object’s purpose and identity has been altered by the motives of its various owners. How the Bodleian labels this item will necessarily be another act of alteration. A strand of hair is never just a strand of hair!

Laura Hackett

Oxford LibGuides: Web Archives

Web archives are becoming more and more prevalent and are being increasingly used for research purposes. They are fundamental to the preservation of our cultural heritage in the interconnected digital age. With the continuing collection development on the Bodleian Libraries Web Archive and the recent launch of the new UK Web Archive site, the web archiving team at the Bodleian have produced a new guide to web archives. The new Web Archives LibGuide includes useful information for anyone wanting to learn more about web archives.

It focuses on the following areas:

  • The Internet Archive, The UK Web Archive and the Bodleian Libraries Web Archive.
  • Other web archives.
  • Web archive use cases.
  • Web archive citation information.

Check out the new look for the Web Archives LibGuide.

 

 

New Catalogue: Papers of Louis MacNeice

The catalogue of the papers of the Northern Irish poet and playwright Louis MacNeice (1907-1963) is now available online.

MacNeice studied Classics at Oxford from 1926, and together with Stephen Spender and Cecil Day-Lewis, he became part of the circle of poets and writer that had formed around W.H. Auden. His professional life began in 1930 as a lecturer in Classics, but in 1941 he joined the BBC and for the next twenty years produced radio plays and other programmes for the Features Department.

Whilst he also wrote articles and reviews, theatre plays, a novel and even a children’s book, MacNeice is best known for his poetry. Between 1929 and 1963, he published more than a dozen poetry volumes, such as Autumn Journal (1939) – regarded by many as his masterpiece, Springboard (1944), Holes in the Sky (1948), Ten Burnt Offerings (1952), and Visitations (1957). His last poetry volume, The Burning Perch came out just a few days after MacNeice’s untimely death in autumn 1963.

Amongst other works published posthumously were a book entitled Astrology (1964), Selected Poems (1964) edited by W.H. Auden, the autobiography The Strings are False (1965) edited by E.R. Dodds, and Varieties of Parable (1965), as well as the radio/ theatre plays The Mad Islands and The Administrator (1964), One for the Grave (1968) and Persons from Porlock (1969), and the song cycle The Revenant (1975).

(Frederick) Louis MacNeice by Howard Coster,
nitrate negative, 1942. NPG x1624.
© National Portrait Gallery, London.
(CC BY-NC-ND 3.0)

The archive at the Bodleian Libraries comprises more than 70 boxes of literary papers and other material relating to Louis MacNeice’s career as a writer, as well as extensive personal and professional correspondence, and some personal papers. Continue reading

Introducing the new UK Web Archive website

Until recently, if you wanted to search the vast UK Legal Deposit Web Archive (containing the whole UK Web space), then you would need to travel to the reading room of a UK Legal Deposit Library to see if what you needed was there. For the first time, the new UK Web Archive website offers:

  • The ability to search the Legal Deposit web archive from anywhere.
  • The ability to search the Legal Deposit web archive alongside the ‘Open’ UK Web Archive (15,000 or so publicly available websites collected since 2005).
  • The opportunity to browse over 100 curated collections on a wide range of topics.

Who is the UK Web Archive?
UKWA is a partnership of all the UK Legal Deposit Libraries – The British Library, National Library of Scotland, National Library of Wales, the Bodleian Libraries, Cambridge University Libraries, and Trinity College, Dublin. The Legal Deposit Web Archive is available in the reading rooms of all the Libraries.

How much is available now?
At the time of writing, everything that a human (curators and collaborators) has selected since 2005 is searchable. This constitutes many thousands of websites and millions of individual web pages. The huge yearly Legal Deposit domain crawls will be added over the coming year.

This includes over 100 curated collections of websites on a wide range of topics and themes. Recent collections curated by the Bodleian Libraries include:

Do the websites look and work as they did originally?
Yes and no. Every effort is made so that websites look how they did originally and internal links should work. However, for a variety of technical  issues many websites will look different or some elements may be missing. As a minimum, all of the text in the collection is searchable and most images should be there. Whilst we collect a considerable amount of video, much of this will not play back.

Is every UK website available?
We aim to collect every website made or owned by a UK resident, however, in reality it is extremely difficult to be comprehensive! Our annual Legal Deposit collections include every .uk (and .london, .scot, .wales and .cymru) plus any website on a server located in the UK. Of course, many websites are .com, .info etc. and on servers in other countries.

If you have or know of a UK website that should be in the archive we encourage you to nominate them via the website.

Another version of this post was first published on the UK Web Archive blog.

New Conservative Party Archive releases for 2019

Speaking notes prepared for Margaret Thatcher, annotated drafts of William Hague’s election leaflets, and briefing papers written by David Cameron as a young researcher are all among files newly-released by the Conservative Party Archive for 2019. This year, our releases are drawn primarily from the records of the Conservative Research Department (CRD): these comprise the department’s subject files and working papers, its briefings prepared for Members of Parliament, and the papers and correspondence of CRD desk officers. In addition to our regular scheduled de-restrictions, the Conservative Party Archive is pleased to announce that the papers of Robin Harris, the Director of the Conservative Research Department from 1985-1989, will also be made available for consultation for the first time. This blog will briefly look at some of the items to be found in each of these main series, demonstrating the value of these collections to researchers of the Conservative Party and historians of modern British history.

Conservative Research Department Files, 1988

Among the newly-released records are a number of files on the ever-thorny question of Europe, including the minutes and papers of the European Steering Committee, the Party’s coordinating group for the 1989 elections to the European Parliament. These files provide a fascinating insight into the challenges the Party faced in trying to balance the record of its MEPs with the increasing Euroscepticism of British Conservatism: a September 1988 report on the Party’s private polling on Europe, for instance, warned that nearly a third of Conservative general election voters were opposed to EEC membership and would not turn out to support the Party in the European Elections [CPA CRD 4/30/3/1]. The Conservative Party Archive has, separately, also recently acquired the records of the Conservative delegation to the European Parliament in this period, and will be seeking to make these available for consultation later in 2019.

Minutes and papers of the European Steering Committee – CPA CRD 4/30/3/1.

Conservative Research Department Briefings, 1988

This year’s releases under the thirty-year rule include a wide range of policy briefings prepared by the Research Department. These briefings, typically prepared for Conservative MPs and Peers ahead of parliamentary debates, provide an excellent snapshot of the Party’s thinking, tactics, and rhetorical strategy on the key issues of the day. Subjects covered by the briefings include some of the most prominent policies of the Thatcher government, including the introduction of the Community Charge (Poll Tax) and the privatisation of state-owned utilities.

A selection of CRD briefings from the Environment and Local Government file, covering the Community Charge, Section 28, and Acid Rain – CPA CRD/B/11/7.

This series notably includes briefing papers prepared by David Cameron during his time in CRD, covering topics on environmental, energy and industrial policy. In 1989 Cameron became the Head of the Political Section, a post he held in the department until 1992, and we expect to be able to de-restrict more of his papers from this period in the years ahead.

Two CRD briefings on Energy Privatisation written by David Cameron – CPA CRD/B/10/8.

Conservative Research Department Letter Books, 1988

The papers and letter books of the Research Department desk officers are a unique resource for those studying the history of Conservatism. Among those files newly de-restricted for 2019 are the letter books of CRD Desk Officer Richard Marsh. Specialising in environmental policy and local government, Marsh’s papers include extensive material on the Poll Tax, and are likely to be of high value to researchers of the subject. Marsh’s papers also include a draft copy of William Hague’s election leaflet from the 1989 by-election, complete with revealing annotations – a pledge to bring in harsher sentences for criminals, for instance, is struck out and replaced with a vaguer commitment to take ‘vigorous action in the fight against crime’ [CPA CRD/L/4/40/2].

Annotated drafts of an election leaflet for William Hague, the Party’s candidate in the 1989 Richmond By-election – CPA CRD/L/4/40/2.

Papers of Robin Harris, Research Department Director, 1985-1988

Finally, the records of CRD Director Robin Harris provide a rich insight into the Conservative Party during the 1980s. For instance, Harris’ letter book for August and September 1987 shows how the Research Department went about preparing material for Thatcher’s speech to the Conservative Party Conference, with draft sections of the speech and working memoranda included in the file [CRD/D/10/2/25].

Robin Harris file on Margaret Thatcher’s 1987 Party Conference speech, including draft speech sections – CPA CRD/D/10/2/25.

Harris’ papers also show how the Party responded at times of political crisis. During the Westland Affair, when Thatcher’s premiership was briefly seen to be threatened, the Party received numerous letters from the public calling on the Prime Minister to resign. Harris’ memo books from the time show how Conservative Central Office managed the situation, drafting template responses defending the government’s conduct [CRD/D/10/1/11]. The papers should prove to be a valuable resource for historians of the period, and we expect to be able to make further de-restrictions in this series under the thirty-year rule in January 2020.

Robin Harris memoranda on the Party’s response to the Westland Affair – CPA CRD/D/10/1/11.

All the material featured in this blog post will be made available from 1 Jan 2019. The full list of de-restricted items will be published shortly on the CPA website, where de-restriction lists from previous years are also available.

Archives Unleashed – Vancouver Datathon

On the 1st-2nd of November 2018 I was lucky enough to attend the  Archives Unleashed Datathon Vancouver co-hosted by the Archives Unleashed Team and Simon Fraser University Library along with KEY (SFU Big Data Initiative). I was very thankful and appreciative of the generous travel grant from the Andrew W. Mellon Foundation that made this possible.

The SFU campus at the Habour Centre was an amazing venue for the Datathon and it was nice to be able to take in some views of the surrounding mountains.

About the Archives Unleashed Project

The Archives Unleashed Project is a three year project with a focus on making historical internet content easily accessible to scholars and researchers whose interests lay in exploring and researching both the recent past and contemporary history.

After a series of datathons held at a number of International institutions such as the British Library, University of Toronto, Library of Congress and the Internet Archive, the Archives Unleashed Team identified some key areas of development that would enable and help to deliver their aim of making petabytes of valuable web content accessible.

Key Areas of Development
  • Better analytics tools
  • Community infrastructure
  • Accessible web archival interfaces

By engaging and building a community, alongside developing web archive search and data analysis tools the project is successfully enabling a wide range of people including scholars, programmers, archivists and librarians to “access, share and investigate recent history since the early days of the World Wide Web.”

The project has a three-pronged approach
  1. Build a software toolkit (Archives Unleashed Toolkit)
  2. Deploy the toolkit in a cloud-based environment (Archives Unleashed Cloud)
  3. Build a cohesive user community that is sustainable and inclusive by bringing together the project team members with archivists, librarians and researchers (Datathons)
Archives Unleashed Toolkit

The Archives Unleashed Toolkit (AUT) is an open-source platform for analysing web archives with Apache Spark. I was really impressed by AUT due to its scalability, relative ease of use and the huge amount of analytical options it provides. It can work on a laptop (Mac OS, Linux or Windows), a powerful cluster or on a single-node server and if you wanted to, you could even use a Raspberry Pi to run AUT. The Toolkit allows for a number of search functions across the entirety of a web archive collection. You can filter collections by domain, URL pattern, date, languages and more. Create lists of URLs to return the top ten in a collection. Extract plain text files from HTML files in the ARC or WARC file and clean the data by removing ‘boilerplate’ content such as advertisements. Its also possible to use the Stanford Named Entity Recognizer (NER) to extract names of entities, locations, organisations and persons. I’m looking forward to seeing the possibilities of how this functionality is adapted to localised instances and controlled vocabularies – would it be possible to run a similar programme for automated tagging of web archive collections in the future? Maybe ingest a collection into ATK , run a NER and automatically tag up the data providing richer metadata for web archives and subsequent research.

Archives Unleashed Cloud

The Archives Unleashed Cloud (AUK) is a GUI based front end for working with AUT, it essentially provides an accessible interface for generating research derivatives from Web archive files (WARCS). With a few clicks users can ingest and sync Archive-it collections, analyse the collections, create network graphs and visualise connections and nodes. It is currently free to use and runs on AUK central servers.

My experience at the Vancouver Datathon

The datathons bring together a small group of 15-20 people of varied professional backgrounds and experience to work and experiment with the Archives Unleashed Toolkit and the Archives Unleashed Cloud. I really like that the team have chosen to minimise the numbers that attend because it created a close knit working group that was full of collaboration, knowledge and idea exchange. It was a relaxed, fun and friendly environment to work in.

Day One

After a quick coffee and light breakfast, the Datathon opened with introductory talks from project team members Ian Milligan (Principal Investigator), Nick Ruest (Co-Principal Investigator) and Samantha Fritz (Project Manager), relating to the project – its goals and outcomes, the toolkit, available datasets and event logistics.

Another quick coffee break and it was back to work – participants were asked to think about the datasets that interested them, techniques they might want to use and questions or themes they would like to explore and write these on sticky notes.

Once placed on the white board, teams naturally formed around datasets, themes and questions. The team I was in consisted of  Kathleen Reed and Ben O’Brien  and formed around a common interest in exploring the First Nations and Indigenous communities dataset.

Virtual Machines were kindly provided by Compute Canada and available for use throughout the Datathon to run AUT, datasets were preloaded onto these VMs and a number of derivative files had already been created. We spent some time brainstorming, sharing ideas and exploring datasets using a number of different tools. The day finished with some informative lightning talks about the work participants had been doing with web archives at their home institutions.

Day Two

On day two we continued to explore datasets by using the full text derivatives and running some NER and performing key word searches using the command line tool Grep. We also analysed the text using sentiment analysis with the Natural Language Toolkit. To help visualise the data, we took the new text files produced from the key word searches and uploaded them into Voyant tools. This helped by visualising links between words, creating a list of top terms and provides quantitative data such as how many times each word appears. It was here we found that the word ‘letter’ appeared quite frequently and we finalised the dataset we would be using – University of British Columbia – bc-hydro-site-c.

We hunted down the site and found it contained a number of letters from people about the BC Hydro Dam Project. The problem was that the letters were in a table and when extracted the data was not clean enough. Ben O’Brien came up with a clever extraction solution utilising the raw HTML files and some script magic. The data was then prepped for geocoding by Kathleen Reed to show the geographical spread of the letter writers, hot-spots and timeline, a useful way of looking at the issue from the perspective of engagement and the community.

Map of letter writers.

Time Lapse of locations of letter writers. 

At the end of day 2 each team had a chance to present their project to the other teams. You can view the presentation (Exploring Letters of protest for the BC Hydro Dam Site C) we prepared here, as well as the other team projects.

Why Web Archives Matter

How we preserve, collect, share and exchange cultural information has changed dramatically. The act of remembering at National Institutes and Libraries has altered greatly in terms of scope, speed and scale due to the web. The way in which we provide access to, use and engage with archival material has been disrupted. All current and future historians who want to study the periods after the 1990s will have to use web archives as a resource. Currently issues around accessibility and usability have lagged behind and many students and historians are not ready. Projects like Archives Unleashed will help to furnish and equip researchers, historians, students and the community with the necessary tools to combat these problems. I look forward to seeing the next steps the project takes.

Archives Unleashed are currently accepted submissions for the next Datathon in March 2019, I highly recommend it.