I have an XML like this:
<w:p> <w:r> <w:rPr /> <w:t> Description 1</w:t> </w:r> </w:p> <w:p> <w:r> <w:rPr /> <w:t>Checkbox 1</w:t> </w:r> <w:r> <w:fldChar w:fldCharType="begin"> <w:ffData> <w:name w:val="" /> <w:enabled /> <w:calcOnExit w:val="0" /> <w:checkBox> <w:sizeAuto /> <w:checked /> </w:checkBox> </w:ffData> </w:fldChar> </w:r> <w:r> <w:rPr /> <w:t> Checkbox 2</w:t> </w:r> <w:r> <w:fldChar w:fldCharType="begin"> <w:ffData> <w:name w:val="" /> <w:enabled /> <w:calcOnExit w:val="0" /> <w:checkBox> <w:sizeAuto /> </w:checkBox> </w:ffData> </w:fldChar> </w:r> </w:p> <w:p> <w:r> <w:rPr /> <w:t> Description 2</w:t> </w:r> </w:p> <w:p> <w:r> <w:rPr /> <w:t> Description 3</w:t> </w:r> </w:p> .....
On this XML I have couples of <w:p> </w:p>
There are some <w:p>
Description
tags that contains checkbox tag after them and some that are empty. For each I need to create a JSON object and store it in a list.
I need to find tags to take text inside <w:t>
and then to continue to another <w:p>
tag to see if it contains checkbox, if yes then to take <w:t>
value the JSON will look like this:
json['description'] = description json['checkbox_text'] = checkbox
else if the tag after Description
tag contain no checkbox then the JSON will contain only one element:
json['description'] = description
My code looks like this:
results = [] default_positions = [m.start() for m in re.finditer('w:p', xml_content)] jsonobj = {} for position in default_positions: if .. : //code else: //code
Any help?
No comments:
Post a Comment