Search Web Services - searchRetrieve Operation - Binding for SRU 1.2: Auxiliary Binding for HTTP GET- Version 1.0

Committee Draft 01

30 June 2008

Specification URIs:

This Version:

http://docs.oasis-open.org/search-ws/june08releases/binding-for-get-V1.0-cd-01.doc (Authoritative)

http://docs.oasis-open.org/search-ws/june08releases/binding-for-get-V1.0-cd-01.pdf  

http://docs.oasis-open.org/search-ws/june08releases/binding-for-get-V1.0-cd-01.html

Latest Version:

http://docs.oasis-open.org/search-ws/v1.0/binding-for-get-V1.0.doc

http://docs.oasis-open.org/search-ws/v1.0/binding-for-get-V1.0.pdf

http://docs.oasis-open.org/search-ws/v1.0/binding-for-get-V1.0.html

Technical Committee:

OASIS Search Web Services (SWS) TC  

Chair(s):

Ray Denenberg <rden@loc.gov>

Matthew Dovey <m.dovey@jisc.ac.uk>

Editor(s):

Ray Denenberg rden@loc.gov

Larry Dixson ldix@loc.gov

Matthew Dovey m.dovey@jisc.ac.uk

Janifer Gatenby Janifer.Gatenby@oclc.org

Ralph LeVan  levan@oclc.org

Ashley Sanders a.sanders@MANCHESTER.AC.UK

Rob Sanderson azaroth@liverpool.ac.uk

Related work:

This specification is related to:

·         Search Retrieve via URL (SRU)

Abstract:

This is a binding for HTTP GET.  It describes the construction of an http URL to encode parameter values of the form ‘key=value’.  Support for Unicode characters is described.

Status:

This document was last revised or approved by the OASIS Search Web Services (SWS) TC on the above date. The level of approval is also listed above. Check the “Latest Version” or “Latest Approved Version” location noted above for possible later revisions of this document.

Technical Committee members should send comments on this specification to the Technical Committee’s email list. Others should send comments to the Technical Committee by using the “Send A Comment” button on the Technical Committee’s web page at http://www.oasis-open.org/committees/search-ws

For information on whether any patents have been disclosed that may be essential to implementing this specification, and any offers of patent licensing terms, please refer to the Intellectual Property Rights section of the Technical Committee web page (http://www.oasis-open.org/committees/search-ws/ipr.php/ipr.php.

The non-normative errata page for this specification is located at http://www.oasis-open.org/committees/search-ws/.

Notices

Copyright © OASIS® 2007. All Rights Reserved.

All capitalized terms in the following text have the meanings assigned to them in the OASIS Intellectual Property Rights Policy (the "OASIS IPR Policy"). The full Policy may be found at the OASIS website.

This document and translations of it may be copied and furnished to others, and derivative works that comment on or otherwise explain it or assist in its implementation may be prepared, copied, published, and distributed, in whole or in part, without restriction of any kind, provided that the above copyright notice and this section are included on all such copies and derivative works. However, this document itself may not be modified in any way, including by removing the copyright notice or references to OASIS, except as needed for the purpose of developing any document or deliverable produced by an OASIS Technical Committee (in which case the rules applicable to copyrights, as set forth in the OASIS IPR Policy, must be followed) or as required to translate it into languages other than English.

The limited permissions granted above are perpetual and will not be revoked by OASIS or its successors or assigns.

This document and the information contained herein is provided on an "AS IS" basis and OASIS DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY OWNERSHIP RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.

OASIS requests that any OASIS Party or any other party that believes it has patent claims that would necessarily be infringed by implementations of this OASIS Committee Specification or OASIS Standard, to notify OASIS TC Administrator and provide an indication of its willingness to grant patent licenses to such patent claims in a manner consistent with the IPR Mode of the OASIS Technical Committee that produced this specification.

OASIS invites any party to contact the OASIS TC Administrator if it is aware of a claim of ownership of any patent claims that would necessarily be infringed by implementations of this specification by a patent holder that is not willing to provide a license to such patent claims in a manner consistent with the IPR Mode of the OASIS Technical Committee that produced this specification. OASIS may include such claims on its website, but disclaims any obligation to do so.

OASIS takes no position regarding the validity or scope of any intellectual property or other rights that might be claimed to pertain to the implementation or use of the technology described in this document or the extent to which any license under such rights might or might not be available; neither does it represent that it has made any effort to identify any such rights. Information on OASIS' procedures with respect to rights in any document or deliverable produced by an OASIS Technical Committee can be found on the OASIS website. Copies of claims of rights made available for publication and any assurances of licenses to be made available, or the result of an attempt made to obtain a general license or permission for the use of such proprietary rights by implementers or users of this OASIS Committee Specification or OASIS Standard, can be obtained from the OASIS TC Administrator. OASIS makes no representation that any information or list of intellectual property rights will at any time be complete, or that any claims in such list are, in fact, Essential Claims.

The names "OASIS",  are trademarks of OASIS, the owner and developer of this specification, and should be used only to refer to the organization and its official outputs. OASIS welcomes reference to, and implementation and use of, specifications, while reserving the right to enforce its marks against misleading uses. Please see http://www.oasis-open.org/who/trademark.php for above guidance.

Table of Contents

1      Introduction. 5

1.1        Terminology. 5

1.2        Normative References. 5

2      Syntax. 6

3      Encoding (Client Procedure) 7

4      Decoding (Server Procedure) 8

5      Example. 9

A.     Acknowledgements. 10

 


1      Introduction

This is one of a suite of specifications for the OASIS SWS (Search Web Services) Standard.  It is an auxiliary binding, for use by other bindings within the suite.

This binding is for HTTP GET.  It describes the construction of an http URL to encode parameter values of the form ‘key=value’.  Support for Unicode characters is described.

1.1    Terminology

The key words “MUST”, “MUST NOT”, “REQUIRED”, “SHALL”, “SHALL NOT”, “SHOULD”, “SHOULD NOT”, “RECOMMENDED”, “NOT RECOMMENDED”, “MAY”, and “OPTIONAL” in this document are to be interpreted as described in [RFC2119]. When these words are not capitalized in this document, they are meant in their natural language sense.

1.2    Normative References

[RFC2119]               S. Bradner, Key words for use in RFCs to Indicate Requirement Levels, http://www.ietf.org/rfc/rfc2119.txt, IETF RFC 2119, March 1997.

2      Syntax

The client sends a request via the HTTP GET method. The request is a URI as described in RFC 3986. Specifically it is an HTTP URL of the form:

<base URL>?<searchpart>

using the standard &-separated key=value encoding for parameters in <searchpart>.

Example

Assume:

-          The base URL is ‘z3950.loc.gov:7090’.

-          The value of parameter ‘version’ is “1.2”.

-          The value of parameter ‘operation’ is ‘”searchRetrieve”.

-          The value of parameter ‘query’ is ”dinosaur”.

 

Then the URL would be:

http://z3950.loc.gov:7090/voyager?version=1.2&operation=searchRetrieve &query=dinosaur

 

And over the wire goes:

GET /voyager?version=1.2&operation=searchRetrieve&query=dinosaur HTTP/1.1
Host: z3950.loc.gov:7090

3      Encoding (Client Procedure)

The following encoding procedure is recommended, in particular, to accommodate Unicode characters (characters from the Universal Character Set, ISO 10646) beyond U+007F, which are not valid in a URI.

  1. Convert the value to UTF-8.
  2. Percent-encode characters as necessary within the value. See rfc 3986 section 2.1.
  3. Construct a URI from the parameter names and encoded values.

Note: In step 2, it is recommended to percent-encode every character in a value that is not in the URI unreserved set, that is, all except alphabetic characters, decimal digits, and the following four special characters: dash (-), period (.), underscore (_), tilde (~). By this procedure some characters may be percent-encoded that do not need to be -- For example '?' occurring in a value does not need to be percent encoded, but it is safe to do so.

4      Decoding (Server Procedure)

  1. Parse received request based on '?', '&', and '=' into component parts: the base URL, and parameter names and values.
  2. For each parameter:
    1. Decode all %-escapes.
    2. Treat the result as a UTF-8 string.

5      Example

Consider the following parameter:

query=dc.title =/word kirkegård

The name of the parameter is "query" and the value is "dc.title =/word kirkegård"

Note that the first '=' (following "query") must not be percent encoded as it is used as a URI delimiter; it is not part of a parameter name or value. The second '=' (preceding the '/') must be percent encoded as it is part of a value.

The following characters must be percent encoded:

The resulting parameter to be sent to the server would then be:

query=dc.title%20%3D%2Fword%20kirkeg%C3%A5rd

Acknowledgements

The following individuals have participated in the creation of this specification and are gratefully acknowledged:

Participants:

Kerry Blinco, Australian Department of Education, Employment and Workplace Relations

Ray Denenberg, Library of Congress

Larry Dixson, Library of Congress

Matthew Dovey, JISC

Janifer Gatenby, OCLC/PICS

Ralph LeVan, OCLC

Farrukh Najmi, Wellfleet Software Corporation

Ashley Sanders, University of Manchester

Rob Sanderson, University of Liverpool