Due to an influx of spam, we have had to impose restrictions on new accounts. Please see this wiki page for instructions on how to get full permissions. Sorry for the inconvenience.
Admin message
The migration is almost done, at least the rest should happen in the background. There are still a few technical difference between the old cluster and the new ones, and they are summarized in this issue. Please pay attention to the TL:DR at the end of the comment.
Searching could be extended with and/or operations and more complex match
terms, like "field X contains/starts-with/ends-with Y".
[...]
Need full specification for the desired feature.
Here's a proposal for extended searching. This supersedes the current specification in a way which keeps all old searches working:
Searching
Supported searches:
[ ] - An empty list matches all contacts.[ [ `<search>` ], [ 'limit', `<number>` ] ] - a 'limit' searchterm with a number as parameter (formatted as string) can be addedat the top level term to truncate the search resultsafter a certain number of contacts. Example: Search([['any-contains','Joe'], ['limit', '10']]) => return the first 10 Joes.As with any other search, the resulting view will be updated ifcontact data changes.The limit must not be changed in a RefineSearch(). A 'limit' termmay (but doesn't have to) be given. If it is given, its value mustmatch the value set when creating the search. This limitationsimplifies the implementation and its testing. The limitationcould be removed if there is sufficient demand.[ [ `<search>` ] ] - the same as [ `<search>` ][ 'or', `<search1>`, `<search2>`, ... ] - combines 0 to n othersearches and results in a match if any of the sub-searchesmatches. ['or'] without any sub-search does not match.[ 'and', `<search1>`, `<search2>`, ... ] - like 'or', but matches ifand only if all of the sub-searches match.[ 'phone', '`<number>`' ] - Look up a valid phone number (= "caller ID").The country code for the current locale is added if no countrycode was given in the number. Phone numbers in the unified addressbook must start with the resulting full number, after being normalizedthe same way.In other words:- Formatting does not matter.- Alpha characters are aliases for numbers on the keypad and match their corresponding number.- Additional digits in the address book are ignored, only the prefix must match (extensions may or may not be included in `<number>`).- Phone numbers in the address book which cannot be normalized cannot be matched.[ 'is|contains|begins_with|ends_with', '`<field>`', '`<text>`','`<flags>`' ] - compares a specified field against the searchtext. For the 'is' operation, the entire field must match, for'contains' anywhere inside the value, for 'begins_with' at thebeginning and for 'ends_with' at the end.Fields are referenced as in the contact dictionary (see below), usingmultiple path components if necessary. Supported for matching are:'full-name' - string'nickname' - string'structured-name/family' - string'structured-name/given' - string'structured-name/additional' - string'phones/value' - telephone number'emails/value' - string'addresses/po-box' - string'addresses/extension' - string'addresses/street' - string'addresses/locality' - string'addresses/region' - string'addresses/postal-code' - string'addresses/country' - stringThe fields referencing value lists ('phones', 'email', 'address')check against any of the entries in these lists.Except for 'phones/value', all values are treated as text values.For text values, the default search without explicit flags iscase-insensitive and accent-sensitive. Spaces between wordsmatter. This behavior can be modified by giving additional,optional flags after the search value:'case-insensitive' - force case-insensitive search (available for the sakeof consistency and just in case, should the default ever change)'case-sensitive' - force case-sensitive searchFor telephone numbers, only digits are compared. Latin alphabeticcharacters are treated as aliases for digits as they typicallyoccur on a keypad or old rotary dial phones ('A', 'b', 'c' map to'1', etc.).If the full name was not set explicitly for a contact, theconcatenation of the given, middle and family with a space asseparator is used instead when matching against the 'full-name'field.Using the current syntax it is not possible to define searcheswhere the *same* value must meet different criteria ("cell phonenumber containing the digits 1234"). Something like that could beadded as a future extensions, for example by allowing searchvalues to have more complex types than the simple '`<text>`'. termwith a more complex type.[ 'any-contains', '`<text>`', `<flags>` ] - Sub-string search for`<text>` in the following contact values: first, middle or lastname, formatted name, nick name, phone number, or emailaddress. Optional flags include: 'case-insensitive' (the default),'case-sensitive'.This search is equivalent to:[ 'or', [ 'contains', 'structured-name/given', '`<text>`', `<flags>` ], [ 'contains', 'structured-name/additional', '`<text>`', `<flags>` ], [ 'contains', 'structured-name/family', '`<text>`', `<flags>` ], [ 'contains', 'full-name', '`<text>`', `<flags>` ], [ 'contains', 'emails/value', '`<text>`', `<flags>` ], [ 'contains', 'phones/value', '`<text>`']]
Note that lookup and search are different: the former is based on a
valid number, the later on user input.
A 'phone' lookup can compare normalized numbers including the country
code, to ensure that the lookup is exact and does not mismatch numbers
from different countries. Heuristics like suffix matching do not do
this correctly in all cases.
An 'any-contains' search is based on user input, which might contain
just some digits in the middle of the phone number. The search ignores
formatting in both input and address book.
Compound searches with 'and' and 'or' are evaluated lazily, from the
first to the last sub-search. Therefore it makes sense to list
sub-searches that are more likely to match first.
This new extended searching criteria are ok.
Just one note regarded:
- "is not possible to define searches where
the same value must meet different criteria"
This meas that a query like this is not allowed?
This new extended searching criteria are ok.
Just one note regarded:
- "is not possible to define searches where
the same value must meet different criteria"
This meas that a query like this is not allowed?
This new extended searching criteria are ok.
Just one note regarded:
- "is not possible to define searches where
the same value must meet different criteria"
This meas that a query like this is not allowed?
That query itself is fine and will be supported. It works as intended because there is only one structured-name.
The comment was about value lists, like telephone numbers. For example, suppose you have two telephone numbers in the same contact:
TEL:1234
TEL:5678
Now you search
['and',
['contains', 'phones/value', '1'],
['contains', 'phones/value', '5']
]
This will match the contact, because each of the terms combined with 'and' has a match in the contact: the first term will match '1234' and the second '5678'. This follows from the strict mathematic definition of the operations, but is admittedly not immediately obvious.
I think it is a corner case. It occurred to me when documenting the semantic and I wanted to write it down. Obviously, better documentation would not have triggered a question ;-} Please suggest a better wording.
Sort status update - recursive queries and "or"/"and" are working.
Using the current syntax it is not possible to define searcheswhere the *same* value must meet different criteria ("cell phonenumber containing the digits 1234"). Something like that could beadded as a future extensions, for example by allowing searchvalues to have more complex types than the simple '`<text>`'. termwith a more complex type.
What I thought I had typed is:
Using the current syntax it is not possible to define searches
where the same value in a value list ...
^^^^^^^^^^^^^^^^^
"term with a more complex type." at the end needs to be removed.
I noticed an inconsistency: the operations should use hyphen instead of underscore, like the other string constants. I noticed because I mistyped them in the tests at first.
I'll make it consistent, so now the operations are:
PIM testing: test field testsdoFilter() gets extended to take (`<full name>`, `<vcard>`) tuples inaddition to the full name alone. Then this is used to create one largevCard that is suitable for testing (field content unique, all fieldsset, etc.) with various filters. All field tests are covered with atleast one positive and one negative case.
PIM: implement 'is/contains/begins-with/ends-with'The operation is a runtime parameter of different classes, whereasextracting the right values to compare via the operation is hard-codedat compile time. This is a rather arbitrary compromise between codeduplication, simplicity and performance (which, in fact, was notmeasured at all).The code for selecting case-sensitivity and the normalization beforethe string operations is shared with the older 'any-contains'operation.
PIM testing: test case for 'and' and 'or'The new TestContacts.testFilterLogic uses the same infrastructure asthe language tests and covers some combinations of 'and' and 'or'.In some cases, Python lists had to be avoided in favor of tuples,because Python cannot automatically map the lists to a 'av' array ofvariants, while sending a tuple enumerating its types can be sent andgets accepted by PIM Manager.
PIM: support recursive search filterThis changes the signature of the filter parameter in Search(),RefineSearch() and ReplaceSearch() from 'as' (array of strings) to'av' (array of variants). Allowed entries in the variant are arrayscontaining strings and/or other such arrays (recursive!).A single string as value is not supported, which is the reason why'av' instead of just plain 'v' makes sense (mismatches can already befound in the sender, potentially at compile time). It also helpsPython choose the right type when asked to send an empty list. Whenjust using 'v', Python cannot decide automatically.Error messages include a backtrace of all terms that the current,faulty term was included in. That helps to locate the error in apotentially larger filter.The scope of supported searches has not changed (yet).
GDBus GIO: support recursive variant with one typeThe signature of the new type is just a 'v' for variant. Recursion isonly supported in combination with std::vector. This is done to keepimplementation simpler. std::vector was chosen over other collectionsbecause it is easier to work with in method calls, the main purposeof the new type.The conversion from D-Bus messages accepts anything which can be mappedinto the variant: arrays, tuples, plain types, and of course variantscontaining those. This helps with backward compatibility (one canturn an interface which took a fixed type like std::vector<std::string>into something with a recursive variant) and with Python, becausePython sends arrays and tuples without converting into variantswhen the app uses those simpler types.One caveat exists with sending an empty list in Python: when using 'v'as signature in the interface, Python doesn't know how to send theempty list without assistance by the programmer (as in dbus.Array([],signature='s'). If possible, avoid the plain 'v' in an interface infavor of something like 'av' (= std::vector<boost::variant>).
This should resolve enhanced searching. Enhanced sorting is still open, but less important and still undefined -> lowering priority and deferring it.