Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make document-level annotations searchable #4906

Open
engisalor opened this issue Jun 25, 2024 · 0 comments
Open

Make document-level annotations searchable #4906

engisalor opened this issue Jun 25, 2024 · 0 comments

Comments

@engisalor
Copy link

Is your feature request related to a problem? Please describe.
Document-level annotations are not indexed/searchable, unlike other annotation types. For instance, in a project with a Sentence Label layer, the CQL search <Sentence_Label/> will find this type of annotation. In contrast, <Document_Label/> returns nothing for projects that use a Document Label layer.

Describe the solution you'd like
Users may need to find a document based on its metadata, either in the Annotation interface or in the Documents interface. CQL searches that can make use of document-level data, e.g., <Document_Label/>, would help users and admin find sets of related documents.

  1. The direct solution would be to index this annotation type just like the others.
  2. A related step would be to add CQL search functionality in the Documents interface. Currently there's a "Filter by name" search bar, which lets admin select and delete sets of documents, but CQL searching (if only for document-level annotations) would be more powerful.

Describe alternatives you've considered

  1. For the Annotation interface, document-level annotations can be added to an arbitrary token as a span-based layer, e.g., on the first token of every document. This can be a perfectly workable solution.
  2. For the Documents interface, if admin need to filter documents by document-level metadata, it can be included in the filename and searched via "Filter by name". This works but can lead to rather long filenames and duplicates document-level metadata (requiring one set in an annotation and another in the filename).

Additional context
Improving these features for document-level annotations would help integrate this annotation type into Inception, although (2) probably warrants a separate issue if pursued.

@reckart reckart self-assigned this Jul 8, 2024
@reckart reckart added this to the ⭐️ Feature backlog milestone Sep 22, 2024
@reckart reckart added this to Kanban Feb 20, 2025
@github-project-automation github-project-automation bot moved this to 🔖 To do in Kanban Feb 20, 2025
@reckart reckart removed their assignment Feb 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: 🔖 To do
Development

No branches or pull requests

2 participants