LISTSERV mailing list manager LISTSERV 16.0

Help for ZNG Archives


ZNG Archives

ZNG Archives


[email protected]


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Monospaced Font

LISTSERV Archives

LISTSERV Archives

ZNG Home

ZNG Home

ZNG  September 2002

ZNG September 2002

Subject:

Re: Where Are We Now?

From:

Reply-To:

Z39.50 Next-Generation Initiative

Date:

Mon, 30 Sep 2002 11:01:42 +1000

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (102 lines)

On Fri, Sep 27, 2002 at 11:01:07AM -0400, LeVan,Ralph wrote:
> I am content with case-insensitive index names. I only suggested
> case-sensitivity because I'm in the middle of doing a bunch of XML stuff and
> that is all case-sensitive.

Does CQL allow Unicode in names? If so, there is a benefit in case
sensitive names. You just do an exact string comparison. No need to
decompose, map case, etc worrying about Unicode rules. Not really
stressed, just pointing out that with Unicode, case insensitive
checks is harder than good old ASCII.

> I believe we still need first-in-field and last-in-field indicators. I
> still like the caret ('^') and dollar-sign as new special characters at the
> beginning of words to do that.

If these operators map on to attributes, I would prefer to keep them
out of the string literals. That is, I believe '^' will map on to
the first-in-field attribute, not be included in the actual term.
My preference is to keep the text in the string literals exactly what is
sent to the server - no local processing required. If this is desirable,
then can anchors be turned into modifiers too? lanc/leftanchored/first/start,
ranc/rightanchored/last/end, lranc, etc. Or else something outside the
string literal?

    ( dc.title = ^ "hello there" ^ )

> Ralph

Also, Robert Sanderson wrote:
> how is
> indexset.index:token = "term"
> any less extensable than:
> indexset.index token "term"

Its not a silly idea at all. In fact, if you remove the ':' character
and allow words or a set of symbols as modifiers, then '=' just becomes
the attribute 'equals'. Most servers would default to this anyway
(or has it changed in AA?) It means queries could be written as

    dc.title stem = "hi there"
or
    dc.title stem "hi there"

The attribute list would be a little different (the first would include
the 'equals' attribute value whereas the second would not), but it does
give query writers very good control over the attribute lists, while
giving some flexibility of what individuals think is 'good style' when
writing queries.

Someone else asked what does the following mean?

    dc.title relevance > "a"

I have no idea. But is it necessary in the grammar to disallow such
constructs? The idea is to have a general way to form attribute lists.
Servers can choose to reject/munge/tweak/ignore whatever combinations they
do not support (or whatever SRW mandates about such behaviour). I would
prefer to keep CQL open to new attributes etc (I keep thinking about GEO),
so I think its out of scope to work out what attribute combinations don't
make sense.


Grammar thought: going back to the idea of what an attribute list is
in the first place, why not include dc.title as a modifier?

query-term = "(" modifier* string-literal ")"
modifier = index-set "." access-point dc.title, etc
         | named-modifier stem, relevance, etc
         | symbolic-modifier =, >, <, etc

I will admit it does allow some bizare looking queries, but it
would also allow omitting the index name for example simply,
and it would also allow nested attribute lists to be supported
(you include multiple access points, defining a attribute path
as supported in AA for nested attributes).

A more conservative grammar would probably be better. Eg: force
zero or more dc.title etc to be first, followed by zero or more
textual modifiers, followed by an optional (zero or more?) symbolic
modifier, followed by the search text. Some power, but more consistent
look to queries.

query-term =
    "(" access-points* named-modifiers* symbolic-modifier? string-literal ")"

This stops people writing
    (stem=dc.title "hi")

Alan

ps: Putting parenthesis around everything is pretty ugly in my opinion
too, and quite unusual for such grammars - unless you like Lisp.
I have not read the web site recently relying on this list for
people to raise things that get changed, so I had not realized the
meeting had gone this way.
--
Alan Kent (mailto:[log in to unmask], http://www.mds.rmit.edu.au/~ajk/)
Project: TeraText Technical Director (http://teratext.com.au) InQuirion Pty Ltd
Postal: Multimedia Database Systems, RMIT, GPO Box 2476V, Melbourne 3001.
Where: RMIT MDS, Bld 91, Level 3, 110 Victoria St, Carlton 3053, VIC Australia.
Phone: +61 3 9925 4114 Reception: +61 3 9925 4099 Fax: +61 3 9925 4098

Top of Message | Previous Page | Permalink

Advanced Options


Options

Log In

Log In

Get Password

Get Password


Search Archives

Search Archives


Subscribe or Unsubscribe

Subscribe or Unsubscribe


Archives

July 2017
October 2016
July 2016
August 2014
February 2014
December 2013
November 2013
October 2013
February 2013
January 2013
October 2012
August 2012
April 2012
January 2012
October 2011
May 2011
April 2011
November 2010
October 2010
September 2010
July 2010
June 2010
May 2010
April 2010
March 2010
February 2010
January 2010
December 2009
October 2009
September 2009
August 2009
July 2009
May 2009
April 2009
March 2009
February 2009
December 2008
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
March 2008
February 2008
January 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
June 2007
May 2007
April 2007
March 2007
January 2007
December 2006
November 2006
October 2006
September 2006
August 2006
July 2006
June 2006
May 2006
April 2006
March 2006
February 2006
January 2006
December 2005
November 2005
October 2005
September 2005
August 2005
July 2005
June 2005
May 2005
April 2005
March 2005
February 2005
January 2005
December 2004
November 2004
October 2004
September 2004
August 2004
July 2004
June 2004
May 2004
April 2004
March 2004
February 2004
January 2004
December 2003
November 2003
October 2003
September 2003
August 2003
July 2003
June 2003
May 2003
April 2003
March 2003
February 2003
January 2003
December 2002
November 2002
October 2002
September 2002
August 2002
July 2002
June 2002
May 2002
April 2002
March 2002
February 2002
January 2002
December 2001
November 2001
October 2001
September 2001
August 2001
July 2001

ATOM RSS1 RSS2



LISTSERV.LOC.GOV

CataList Email List Search Powered by the LISTSERV Email List Manager