Parsing using lxml and requests with python



Recently I was trying to parse html table from web page using lxml and requests the python code runs like this



>>>from lxml to html
>>>import requests
>>>page=requests.get('http://ift.tt/1KbRANb evening-star-candlestick-pattern')
>>>tree=html.fromstring(page.text)'


Then I would like to parse the following repetitive data block using lxml xpath function to get lists



<TR>
<TD style="font-size: 11px;"><!-- <a href="/company-technical-details/<%=sr.getExchange()%>/<%=sr.getSymbol()%>/<%=sr.getName()%>" ><%= sr.getSymbol() %></a> -->
AMTEKINDIA </TD>
<TD style="font-size: 11px; max-width: 135px;">AMTEK INDIA LIMITED</TD>
<TD> nse </TD>
<TD style="min-width: 60px; max-width: 60px;">02-01-2015</TD>
<TD>78</TD>
<TD>78.3</TD>
<TD>72.25</TD>
<TD>73.9</TD>


But unable to do so getting error Eg:



>>> symbol=tree.xpath('//TD[@style="font-size: 11px;"][@!-- [@a href="/company-t
echnical-details/[@%=sr.getExchange()%]/[@%=sr.getSymbol()%]/[@%=sr.getName()%]"
][@%= sr.getSymbol() %][@/a] --]/text()')


giving Xpath eval error and



>>> prices=tree.xpath('//TD/text()')


returning list with no values Please help with the xpath function


No comments:

Post a Comment