The Virtual Studio

I am in Venice to present a paper with two colleagues from the School of Architecture, at a two-day conference organised by the Metadata for Architectural Materials in Europe (MACE) Project. Yesterday was a significant day, for reasons I want to detail below. Skip to the end of this long post, if you just want to know the outcome and why this conference has been an important and positive turning point in the Virtual Studio project.

I joined the university just over a year ago to work on the JISC-funded LIROLEM Project:

The Project aimed to lay the groundwork for the establishment of an Institutional Repository that supports a wide variety of non-textual materials, e.g. digital animations of 3-D models, architectural documentation such as technical briefings and photographs, as well as supporting text based materials. The project arose out of the coincidental demands for the University to develop a repository of its research outputs, and a specific project in the school of Architecture to develop a “Virtual Studio”, a web based teaching resource for the school of Architecture.

At the end of the JISC-funded period, I wrote a lengthy summary on the project blog, offering a personal overview of our achievements and challenges during the course of the project. Notably, I wrote:

The LIROLEM Project was tied to a Teaching Fellowship application by two members of staff in the School of Architecture. Their intentions were, and still are, to develop a Virtual Studio which compliments the physical design Studio. Although the repository/archive functionality is central to the requirements of the Virtual Studio, rather than being the primary focus of the Studio, a ‘designerly’, dynamic user interface that encourages participation and collaboration is really key to the success of the Studio as a place for critical thinking and working. In effect, the actual repository should be invisible to the Architect who has little interest, patience or time for the publishing workflow that EPrints requires. More often that not, the Architects were talking about wiki-like functionality, that allowed people to rapidly generate new Studio spaces, invite collaboration, bring in multimedia objects such as plans, images and models, offer comment, discussion and critique. As student projects developed in the Virtual Studio, finished products could be archived and showcased inviting another round of comment, critique and possibly derivative works from a wider community outside the classroom Studio.

Our conference paper discussed the difficulties of ensuring that the (minority) interests of the Architecture staff were met while trying to gain widespread institutional support and sustainability for the Institutional Repository which the LIROLEM project aimed, and had an obligation, to achieve. During the presentation (below), we asked:

Can academics and students working in different disciplines be easily accommodated within the same archival space?

Our presentation slides. My bicycle is a reference to Bijker (1997)

The paper argues that advances in technology result from complex and often conflicting social interests. Within the context of the LIROLEM Project, it was the wider interests of the Institution which took precedence, rather than the minority interests of the Architectural staff.  I’m not directing criticism towards decisions made during the project; after all, I made many of them so as to ensure the long-term sustainability of the repository, but yesterday we argued that

architecture is an atypical discipline; its emphasis is more visual than literary, more practice than research-based and its approach to teaching and learning is more fluid and varied than either the sciences or the humanities (Stevens, 1998). If we accept that it is social interests that underlie the development of technology rather than any inevitable or rational progress (Bijker, 1997), the question arises as to what extent an institutional repository can reconcile architectural interests with the interests of other disciplines. Architecture and the design disciplines are marginal actors in the debate surrounding digital archive development, this paper argues, and they bring problems to the table that are not easily resolved given available software and that lie outside the interests of most other actors in academia.

Prior to the conference, I was unsure of what to do next about the Virtual Studio. I felt that the repository was the wrong application for supporting a collaborative studio environment for architects. Central to this was the unappealing deposit and cataloguing workflow in the IR and the general aesthetic of the user interface which, despite some customisation, does not appeal to designers’ expectations of a visual tool for the deposit and discovery of architectural materials.

However, the MACE Project appears to have just come to our rescue with the development of tools that query OAI-PMH data mapped to their LOM profile, enriches the harvested metadata (by using external services such as Google Maps and collecting user generated tags, for example) and provides a social platform for searching participating repositories. I managed to ask several questions throughout the day to clarify how the anticipated architectural content in our repository could be exposed to MACE.  My main concern was our issue of having a general purpose Institutional Repository, but wanting to handle subject-specific (architecture) content in a unique way. I was told that the OAI-PMH has a ‘set‘ attribute which could be used to isolate the architectural content in the IR for harvesting by MACE. Another question related to the building of defined communities or groups within the larger MACE community (i.e. students on a specific course) and was told that this is a feature they intend to implement.

Because of the work of MACE, the development of a search interface and ‘studio’ community platform has largely been done for us (at least to the level of expectation we ever had for the project). Ironically, we came to the conference questioning the use of the IR as the repository for the Virtual Studio, but now believe that we may benefit from the interoperability of the IR, despite suffering some of its other less appealing attributes. One of the things that remains for us to do, is improve the deposit experience to ensure we collect content that can be exposed to the MACE platform.

For this, I hope we can develop a SWORD tool that simplifies the deposit process for staff and students, reducing the work flow process down to the two or three brief steps you find on Flickr or YouTube, repositories they are likely to be familar with and judge others against. User profile data could be collected from their LDAP login information and they would be asked to title, describe and tag their work. A default BY-NC-ND Creative Commons license would be chosen for them, which they could opt out of (but consequently also opt out of MACE harvesting, too).

Boris Müller, who works on the MACE project, spoke yesterday of the “joy of interacting with [software] interfaces.” This has clearly been a central concern of the MACE project as it has been for the Virtual Studio project, too. I’m looking forward to developing a simple but appealing interface that can bring at least a little joy to my architect colleagues and their students.

MACE Conference

On Friday, Andy, Carl and I are going to the Metadata for Architectural Contents in Europe (MACE) Conference in Venice, to present a paper which reflects on the issues raised during our JISC-funded LIROLEM Project. Here’s a Word Cloud of the paper, for those of you who don’t have the time or inclination to read it. For those of you who do, it’s in our repository, of course.

 

Image created at wordle.net

EPrints Session and OR08 Reflections

Back in the office, following a week away at the Open Repositories conference.

The last couple of days were spent in EPrints sessions, as that is the repository software we use here at Lincoln. I found the first session most interesting as the new features in EPrints 3.1 were discussed. The linked page explains in detail the changes in v3.1, but in summary they provide much more control for repository managers through a web interface, rather than editing config files directly. Les’ slides give a nice overview.

The following session on EPrints and the RAE generally reflected the experience we’ve had using EPrints 2 for the RAE last year.

A session on repository analytics was a very useful overview of using Google Analytics, AWStats and IRStats to measure the various uses of an EPrints repository. Very useful, in particular IRStats which has been developed at Southampton for EPrints. I look forward to installing it.

The final sessions were mainly aimed at developers with a knowledge of Perl. I found the session on how to write plugins for EPrints 3 clear and interesting, but not especially useful as I don’t understand Perl. Still, it was obvious, even to me, that with a basic knowledge of programming, plugins could be written quite easily. I think it’s important for repository managers to immerse themselves in the technicalities of repository development even if they don’t understand much of the detail. Just by sharing ideas and questions with developers, you get a better understanding of what is involved in rolling out new features and a sense of what can be achieved within given resources.

On the whole, the conference leaned towards the technical rather than the strategic and managerial aspects of institutional repositories. There were a lot of developers present and the number of technical projects discussed seemed high. Personally, I appreciated this and came away with a good sense of where the development of repositories is going. It would have been good to have had an event which explicitly aimed at bringing both developers and repository staff together.

Finally, I do wonder whether the open access repository community would benefit from engaging with developments in Enterprise Content Management, as there is a great deal of overlap, having to face similar issues around workflow, IPR and technical standards. Perhaps there are universities evaluating the open source Alfresco ECMS as a repository platform. If so, I’d like to hear about them.

Next year, the conference is in Atlanta, USA.

Session 7: Usage

This part of the conference ended with two excellent and very different presentations on measuring the usage and impact of scholarly output.

Tim Brody, from the University of Southampton, discussed his work developing IRStats, a tool to measure the use and impact of open access repositories. IRStats has been developed to answers questions such as, “What is Professor Smith’s most downloaded paper?” and “Who is the most highly downloaded author in Mathematics?” Existing tools such as Google Analytics and AWStats, don’t offer this level of detail, which can be useful for both strategically placing the repository as an important tool in the University and as a service to both individual scholars and departments. IRStats is available for EPrints and I intend to try it in our repository.

The final presentation was by Johan Bollen, from the Los Alamos National Laboratory. He took off from where Tim left us and discussed a much larger scale project called MESUR. This project also attempts to measure the impact of scholarly output by analysing metrics from usage data. It differs to the IRStats project in both its methodology and scale, combining the evaluation of usage, citation and bibliographic data. By analysing this data, they’ve produced some fascinating graphs which show the relationships between academic disciplines. This is a project I look forward to learning more about.

As I mentioned, this was the last session in this part of the conference. The next day-and-a-half, I will be attending an EPrints User Group Session, where I hope to learn more about the new version of EPrints, the experience people had of the RAE excercise and repository analytics. There’s also a couple of training and support sessions which will be useful.

Session 5: Legal

Grace Agnew, from Rutgers Universities Library, presented over 40 PPT slides on Digital Rights Management. Her book is due to be published later this year. It’s still not clear to me why we need DRM in open access repositories. Surely this conference is an opportunity to promote the benefits of Copyleft. A simple way of managing the rights to academic research, which costs nothing, is to attach a Creative Commons license to the work. It’s what software developers, with the similar GPL license have been doing for 20 years with great reward. Of course, this ignores the issue that for most academics, the IPR ultimately belongs to the University and it’s at this level that the discussion needs to be had. An academic who deposits their work in a repository and chooses to license their work under a Creative Commons license, may be forgetting that they do not own the work in the first place. In practice, academics are usually free to publish their work as they wish, but the explicit application of a Copyleft license is, unfortunately, not a guaranteed right.

Next, Brian Fitzgerald, from QUT Law School, discussed the OAK Law Project. The project looks fascinating and notably links to the Creative Commons initiative in Australia. It’s a shame that Brian didn’t talk about that and its relevance to open access repositories.

Finally, Jenny Brace, from the Version Identification Framework Project, presented the results of their project. By this point, the microphone in the auditorium had stopped working and I couldn’t hear very much, which was a shame, as it’s an important and interesting area of study, something which I’ve had to deal with ever since working in Collections Management at the NFTVA, where the correct ‘versioning’ of TV and Film materials was a constant issue. ‘Version’ means different things to different groups of people.

Session 4: National & International Perspectives

Arjan Hogenaar & Wilko Steinhoff, from KNAW, gave a presentation on AID, a Dutch Academic Information Domain. I’ll be honest and admit I didn’t pay much attention to this as I was writing up my blog notes for Session 3. Follow the hyperlinks for more information.

I was able to concentrate on the next two presentations which were both interesting and relevant to our work at Lincoln. The first was by Chris Awre, from the University of Hull, who is working on the EThOS project, a joint project between several HE institutions and the BL. It’s a project to provide a central repository service for e-theses produced in the UK. The idea is that the BL will harvest e-thesis specific UK ETD metadata provided by University repositories to create a single point of access to this type of academic output. Interestingly, the business model for this is a subscription service, whereby universities are expected to pay for the harvesting of metadata and digitisation of hard copy theses when they are requested. The content is Open Access (search, download), financially supported by a paid-for harvesting and digitisation service. It’s always interesting to see how people are creating new business models based on freely giving a product away. I hope it’s a success.

The third presentation was by Vanessa Proudman, from Tilburg University and the DRIVER Project. This was excellent, not least because of the rare clarity of presentation but also because the research findings are directly relevant and useful to us at Lincoln as we embark on establishing a repository service in the University. Vanessa looked at the challenges we face in populating our repositories and suggested key methods of increasing the number of deposits, noting that even with a Mandate, the deposit rate is only 40-60%. This work is published as part of a new book (chapter 3), which, naturally, can be downloaded here. Upon return to work, I intend to look at this in detail and begin drafting a plan for the next phase of our repository project, which is to establish an Open Access Mandate at the University and begin the important advocacy work within the Faculties.

Session 3: Interoperability

The final three presentations of the day focussed on interoperability. The first two, specifically discussed ways to make it easier for users to deposit materials into repositories. Julie Allinson, from the SWORD Project, discussed the work they have done and the use of the Atom Publishing Protocol as a framework for developing a derivative SWORD deposit profile. The presentation finished by noting that a NISO standard and tools are being developed for this same purpose and it is hoped that they take into consideration the work done by the SWORD project.

Scott Yeadon, from the Australian National University, gave a presentation on tools which the RIFF Project have developed for DSpace and Fedora to facilitate easier deposit of content into these repositories. Their work took real world examples of content to deposit and developed a submission service, a METS content packaging profile and dissemination service.

Both the SWORD and RIFF Projects demonstrated working examples of their services, albeit in early form. The main question remaining is whether they will be adopted beyond the confines of the project. Part of project work is research and development, but a significant part is also the marketing of the results of the project, for which OR2008 is clearly an important venue.

Finally, Dean Krafft, from Cornell University, presented NCore, a wide range of open source tools for creating digital repositories. Much bigger in scale than the previous two projects, the NCore platform is notable for being released on Sourceforge as a community project. It also has guaranteed funding until 2012, suggesting that even greater work is to come. It’s basically a suite of software tools and services built around the Fedora repository, developed to manage millions of objects, initially at the National Science Digital Library (NSDL). It was an excellent presentation of what appears to be a successful project and set of products. Building a Fedora repository requires a higher investment of resources than installing DSpace or EPrints and projects which use this platform, although often complex and difficult, tend to produce very interesting and impressive results.