Project

General

Profile

Actions

Bug #329

open

An efficient way to retain grammar elements data in DWD array, coordinates & tabular formats

Added by Joseph Potvin 10 months ago. Updated 10 months ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
Refine
Start date:
07/31/2025
Due date:
% Done:

0%

Estimated time:

Description

While comparing the four "Preview Lookup Equivalents" I noticed there is essential data missing from all except the JSON format. In the others I neglected to retain the grammatical sections of the sentences.

Please have the Table, the DWD Array and the DWD Coordinates store every sentence with JSON object segmentation, for example:

{"determiner":"This","noun":"banking institution or microfinance banking institution, or its controlling company","predicate_verb":"is liable to","attribute":"penalties ","description":"contemplated in section 94(1)","past_participle_verb":"as determined by the Central Bank."}

Since the natural language in this field needs to be parsed anyways, I think there's little lost and processing power gained by including this part of the JSON. Also it makes sense to borrow the JSON syntax rather than create a novel method, so that it is already rountine.


Files

clipboard-202507311423-zj34y.png View clipboard-202507311423-zj34y.png 58.8 KB Huda Hussain, 07/31/2025 06:23 PM
Actions #1

Updated by Joseph Potvin 10 months ago

Also, the order in which the elements have been arranged by the rule author must be retained, so that when someone is auditing the DWD Coordinate List, every sentence remains readable in the given language.

Actions #2

Updated by Huda Hussain 10 months ago

  • Status changed from New to Feedback
  • Assignee changed from Huda Hussain to Joseph Potvin

This should be done!

Actions #3

Updated by Huda Hussain 10 months ago

I realize that the JSON format requires double quotes to surround the values for example: {"determiner":"This"} or {"noun":"banking institution or microfinance banking institution, or its controlling company"}

However this will mess up the csv format:

Are you sure you want to display it in JSON format?

Actions #4

Updated by Joseph Potvin 10 months ago

  • Assignee changed from Joseph Potvin to Huda Hussain

Hmm, can you think of a workaround for JUST the csv format? Since one would not want JSON and CSV at the same time in any case.

Actions #5

Updated by Huda Hussain 10 months ago

I think I can come up with something...

Actions #6

Updated by Huda Hussain 10 months ago

  • Assignee changed from Huda Hussain to Joseph Potvin

If you double the double quotes then the formatting works, so that is what happens when you download a csv file.

{"determiner":"This"} -> " {""determiner"":""This""}"

Actions #7

Updated by Joseph Potvin 10 months ago

  • Status changed from Feedback to Resolved
  • Assignee changed from Joseph Potvin to Huda Hussain

Great -- yes I did test your CSV output several days ago and it loads well into a spreadsheet. Sorry, I forgot to say!

Actions #8

Updated by Huda Hussain 10 months ago

I am glad to hear it! Is this task completed? Apologies if I forgot something.

Actions

Also available in: Atom PDF