PledgePolicyPrototype
From DSpace Wiki
Contents |
[edit] Policy Storage Prototype
This page describes the Policy Repository that was created for the PLEDGE Project, an add-on-style extension to DSpace 1.5.
[edit] Introduction
[edit] What do we mean by Policy?
In this context, Policy refers to the general definition of the
word rather than the specific meaning it has acquired in the authorization
area of the DSpace data model (e.g. around the ResourcePolicy
and PolicySet classes).
On this page, a policy is typically either a rule describing (or prescribing) the
interactions of actions that take place within the archive, or a constraint
determining when and by whom an action may be taken.
For example, a policy could demand that every Item being submitted include an approved deposit license. Another policy might demand that every Bitstream in the asset store be checked for content integrity (i.e. checksum recomputed and compared with the checksum on record) at least once in every six moths.
[edit] A Policy Repository
Since one of the goals of the PLEDGE project was to develop a machine-readable encoding of policies, we needed a place to put them. Policies may be associated with all types of data model objects: Items, Collections, Communities, etc. It would have required database changes to manage this within the existing object model.
However, the data we have to store is in RDF, and
History System Prototype includes an
RDF triple-store that can associate RDF statements with instances of DSpace
objects, all outside of the DSpace data model.
The RDFRepository class from the History system
is easily subclassed to serve as a policy store as well.
[edit] Implementation Details
Policies are written in the Rei language, which is based on RDF and OWL. A policy is thus a collection of RDF statements. See the Rei examples in these samples of policy expressions.
[edit] Binding of Policies to DSpace Objects
The PolicyRepository class lets you store RDF statements
keyed to a DSpace object, so they can be retrieved later in the context
of the object. An object thus accumulates policies bound to it.
The policy information model assumes that the policies of an object also apply (when relevant) to objects below it in the "ownership" hierarchy. For example, a policy dictating replication terms at the Community level would also apply to each Collection and Subcommunity under that Community, and to the Items, etc, belonging to them.
In practice, this "inheritance" behavior would be implemented by a policy enforcement engine, but that has not even been designed yet. It is sufficient for the policy repository to retrieve the policies related to one DSpace object; a policy engine or other application can use the data model API to find other related objects (e.g. ancestors) and retrieve their policies.
As an example, the class PolicyStackStreamDisseminationCrosswalk
exports all of the policies belonging to an object and to its "stack"
of owner/ancestors -- its owner, its owner's owner, etc. on up to the Site.
This was done so that all possibly-relevant policies can be put into
a Dissemination Information Package (DIP) which is sent to a policy-aware
storage repository such as the SDSC's iRODS.
[edit] Installation
[edit] Prerequisites
- Latest DSpace 1.5 development source
- Install History System Prototype, which requires, in order:
- Install the Event System prototype
- Install the AIP prototype patch
- Install History System Prototype
[edit] Download and Install
- Download sources to add and unzip it in your DSpace install (source) directory.
- Download diffs to the dspace.cfg file and apply to [dspace]/config/dspace.cfg with the patch utility. Note that you must update the config file used by the running DSpace instance.
1. To install, rebuild and install dspace.jar with the command: (There is no need to rebuild the WARs since the UIs never call the policy repository.)
ant install_code
2. Also, be sure the configuration changes are installed in the DSpace configuration file: [dspace]/config/dspace.cfg .
3. Finally, create the directory mentioned in the configuration as the value of policy.dir, e.g. [dspace]/policy . Be sure it is writable by the user who runs DSpace.
[edit] Usage
These examples assume the following contents in the archive, so substitute equivalent objects in your archive:
- 123456789/8 - an Item
- 123456789/7 - a Collection
- 123456789/3 - a Community
- 123456789/0 - the Site
Run the PolicyRepository command-line application with --help to learn about all of its options. It is the same as the HistoryRepository application in the History System Prototype.
dsrun edu.mit.pledge.PolicyRepository --help
[edit] Adding Policies to Objects
- Add Deposit Agreement policy to Item 123456789/8
- dsrun edu.mit.pledge.PolicyRepository -s 123456789/8 cu0006rei.xml
- Add Replication policy to Collection 123456789/7
- dsrun edu.mit.pledge.PolicyRepository -s 123456789/7 tu0011rei.xml
- Add Public Availability policy to Community 123456789/3
- dsrun edu.mit.pledge.PolicyRepository -s 123456789/3 cu0008rei.xml
- Add required-metadata policy to the Site (123456789/0)
- dsrun edu.mit.pledge.PolicyRepository -s 123456789/0 pp0004rei.xml
[edit] Examining Policies
This command will "disseminate" all of the policies associated with an object identified by Handle, in this case the Site:
dsrun edu.mit.pledge.PolicyRepository -d 123456789/0
Add the -f option to change the output format to e.g. N3:
dsrun edu.mit.pledge.PolicyRepository -f n3 -d 123456789/0
[edit] Make an AIP that includes Policies
Be sure your DSpace Configuration includes a line like this:
aip.disseminate.techMD = PREMIS, AllPolicies:POLICY_STACK, ObjectPolicies:POLICY
The following command creates an AIP of the Item 123456789/8
org.dspace.app.packager.Packager -d -t AIP -i 123456789/8 -e ADMIN-USER policy-aip.zip
You can download the sample AIP here.
Note that the mets.xml manifest includes the element:
<techMD ID="techMD_7">
<mdRef LOCTYPE="URL" xlink:type="simple" xlink:href="metadata_69"
MDTYPE="OTHER" OTHERMDTYPE="AllPolicies" MIMETYPE="text/xml"/>
</techMD>
This identifies metadata_69 as the whole "stack" of policies that applies to the Item. Another file in the AIP, metadata_67, contains just the policies that are actually bound to the Item itself.
[edit] Future Work
This is only an experimental prototype. The PolicyRepository implementation is crude, but it is adequate to get policy metadata in the archive and into AIPs for experimenting with other, policy-aware, repositories.
[edit] Next Steps
- Deploy and use this prototype to test it.
- Consider whether there is a need to modify a policy's statements as they are added to the repository, e.g. inserting the identifiers of specific objects and concrete values to replace variables or placeholders in the RDF.
- Editing or removing policies is not implemented. The History System's RDF repository is only concerned with adding statements, and has no mechanism to group the statements belonging to a policy (i.e. a subset of the statements bound to an object) so they can be removed or replaced as a group. (Perhaps use RDF reification to implement grouping.)
