I have limited html and xml knowledge and I'm trying to scrape some URLs to obtain a block of text using =ImportXML() on Google Spreadsheets.
Here is the link: http://ift.tt/18zaZcA
<div id="abstractmore" style="">
<h2>ABSTRACT*</h2>
<p>
The project aims to...be responsible for the general management of the project.<a href="javascript:;" id="rdless" class="more"> Read Less»</a>
</p>
</div>
I am trying to extract the complete abstract. I used Chrome's inspect element tool and browsed through various tutorials...I was able to come up with these xpaths from what I've read:
//div[@id='abstractmore']/p/text()
//*[@id="abstractmore"]/p/text()
These are returning with error: Imported content is empty. I am completely lost as to how to figure out xpath!?
No comments:
Post a Comment