graysqwrl 0 Posted July 21, 2010 Report Share Posted July 21, 2010 Basically I'm trying to scrape all of the attributes of a real estate property and add it to a .csv file. I've never scraped before and I seem to be having problems with getting html tags stuck in all of my results. Basically... I want one column of the attribute names and one column of the numbers. So it would look like this: ASSD LAND:, $163,816ASSD IMP:, $35,100TOT VAL:, $198,916 etc.... <a href="http://www.themls.com/maps/MapPrintView.aspx?apn=5054003002" target="_blank">633/J6</a> </TR> <TR> <TD><B>ASSD LAND:</B> $163,816</TD> <TD><B>ASSD IMP:</B> $35,100</TD> <TD><B>TOT VAL:</B> $198,916</TD> </TR> <TR> <TD><B>ASSD YR:</B> 2008</TD> <TD><B>PROPERTY TAX:</B> $ 2,738.47</TD> <TD> </TD> </TR> <TR> <TD colspan="1" nowrap><B>OWNER NAME:</B>TIEGER TODD D</TD> <TD colspan="3"><B>OWNER ADD :</B> PO BOX 5882</TD> </TR> <TR> <TD><B>OWNER CITY:</B> PLAYA DEL REY </TD> <TD><B>OWNER STATE :</B> CA</TD> <TD><B>OWNER ZIP :</B> 90296</TD> </TR> <TR> <TD><B>OWNER PHONE :</B> N/A</TD> <TD><B>ABSENTEE OWNER :</B> No</TD> <TD> </TD> </TR> <TR> <TD colspan="2"><B>CO-OWNER NAME:</B> </TD> <TD> </TD> </TR> <TR> <TD><B>TRANS DATE: </B>01/05/2001</TD> <TD nowrap><B>PRIOR TRANS DATE:</B> 06/1990</TD> <TD> </TD> </TR> <TR> <TD><B>DOC NO.:</B> 27841</TD> <TD><B>PRIOR DOC NO.:</B> 1125030</TD> <TD> </TD> </TR> <TR> <TD><B>DOC TYPE:</B> GD</TD> <TD><B>PRIOR PRICE:</B> 0</TD> <TD> </TD> </TR> <TR> <TD><B>LV AREA: </B>4,712</TD> Can someone point me in the right direction? Thanks Quote Link to post Share on other sites
meter 145 Posted July 21, 2010 Report Share Posted July 21, 2010 This should scrape the first 3 values: ($163,816 $35,100 $198,916) http://img33.imageshack.us/img33/4122/43402642.jpg Quote Link to post Share on other sites
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.