01-25-10, 07:43 PM
Newbie Coder
Join Date: Jan 2010
Posts: 10
Thanks: 0
Thanked 0 Times in 0 Posts
php scraper help
hi all I am hoping someone can help me out I downloaded a scrapeing script from the web to setup a scraping for a friend from there myspace page so they only have to maintain one calender between the two sites problem is I can't get the script to scrap the site if anyone could help me it would be great I am including the script files as they are configured now.
Also I am extremely glad to be here
Thanks,
yonu
01-25-10, 09:39 PM
-
Join Date: Feb 2006
Posts: 2,515
Thanks: 20
Thanked 109 Times in 106 Posts
It might be easier to use the API
API - Facebook Developer Wiki
01-25-10, 09:42 PM
Newbie Coder
Join Date: Jan 2010
Posts: 10
Thanks: 0
Thanked 0 Times in 0 Posts
not sure i understand how will the facebook api help me with myspace?
01-28-10, 11:50 PM
Newbie Coder
Join Date: Jan 2010
Posts: 10
Thanks: 0
Thanked 0 Times in 0 Posts
hey thanks for the info however i have gotten nowhere with the apis and open socail would it be possible for someone to help me with the script i posted or with how to get the api to pull what i need from the profile
Thanks,
yonu
01-29-10, 04:16 PM
Aspiring Coder
Join Date: Mar 2009
Location: North Carolina, USA
Posts: 516
Thanks: 5
Thanked 47 Times in 44 Posts
In order to scrape a site, you need two things.
1. The code you're working with. <which you included>
2. The HTML of the page you are trying to scrape.
01-29-10, 08:33 PM
Newbie Coder
Join Date: Jan 2010
Posts: 10
Thanks: 0
Thanked 0 Times in 0 Posts
if it helps here is the section of the page I am trying to scrape
I suppose i should have included it in my first post
Thanks,
yonu
01-29-10, 09:15 PM
-
Join Date: Feb 2006
Posts: 2,515
Thanks: 20
Thanked 109 Times in 106 Posts
I don't think scraping the page is a good approach. That's why they build APIs.
Also, people are more likely to help if you post the code, rather than a file.
01-29-10, 09:29 PM
Newbie Coder
Join Date: Jan 2010
Posts: 10
Thanks: 0
Thanked 0 Times in 0 Posts
ok then here is the code and as top the api /I was unable to find a method to extract the required data using a API
Code:
<div id="profile_bandschedule">
<table bordercolor="#6699cc" cellspacing="0" cellpadding="0" width="440" bgcolor="#6699cc" border="0">
<tr>
<td>
<table width="440" border="0" cellspacing="0" cellpadding="0">
<tr>
<td bgcolor="#6699cc" class="text" align="left" style="WORD-WRAP:break-word"> <span class="whitetext12">Upcoming Shows</span></td>
<td align="right"><font color="#ffffff" size="2" face="Arial, Helvetica, sans-serif"><span align="right" class="whitelink"><font size="1">( <a href="http://collect.myspace.com/index.cfm?fuseaction=bandprofile.listAllShows&friendid=141122568&n=Keynote+Company+(NEW+BLOGS)" class="whitelink">view all</a> )</font></span></font></td>
</tr>
</table>
</td>
</tr>
<tr>
<td style="PADDING-RIGHT: 3px; PADDING-LEFT: 3px; PADDING-BOTTOM: 3px; PADDING-TOP: 3px">
<table width="440" border="0" cellspacing="2" cellpadding="2" bgcolor="#ffffff">
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Jan 30 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">5:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553492">PROMO - A Loss For Words, Lions Lions, And Then There Were None, Thieves and Villains ++</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">ORIENT HEIGHTS COMMUNITY CENTER - EAST BOSTON, MA</font></td>
</tr>
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Feb 9 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">7:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38552863">World In Arms, Break Of Dawn, Scalpel, Destruction From Within, Vicitim of Circumstance ++</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">CLUB HELL - PROVIDENCE, RI</font></td>
</tr>
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Feb 16 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">7:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38549920">I AM GHOST (EPITAPH), Modern Day Escape (Standby), The Becoming (Tooth and Nail), locals TBA</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">CLUB HELL - PROVIDENCE, RI</font></td>
</tr>
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Feb 23 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">7:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38552864">Lost in Layaway, Sleep City, June and the Ocean, Worth The Weight, Stealing Harvard + 1 TBA</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">CLUB HELL - PROVIDENCE, RI</font></td>
</tr>
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Feb 27 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">8:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553562">HOLD</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
</tr>
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Mar 2 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">5:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553415">HOLD</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
</tr>
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Mar 7 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">6:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38555091">THE ATARIS !!!! w/ Don’t Panic, Half Hearted Hero, Foredoes Me Quite + The Intel</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">PAL HALL - FALL RIVER, MA</font></td>
</tr>
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Mar 14 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">2:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38552040">TRANSIT (Run For Cover), The Stereo State, more TBA</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
</tr>
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Mar 21 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">2:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553218">HOLD</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
</tr>
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Mar 27 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">8:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553442">HOLD</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
</tr>
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Apr 4 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">8:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553441">HOLD</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
</tr>
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Apr 9 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">5:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553820">HOLD</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
</tr>
<tr>
<td width="120" bgcolor="#b1DOfO">
<table width="120" border="0" cellspacing="2" cellpadding="0">
<tr>
<td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Apr 19 2010</font></td>
<td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">7:00P</font></td>
</tr>
</table>
</td>
<td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38555368">TBA</a></font></td>
<td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
</tr>
</table>
</td>
</tr>
</table>
</div>
01-30-10, 04:06 PM
-
Join Date: Feb 2006
Posts: 2,515
Thanks: 20
Thanked 109 Times in 106 Posts
Last edited by wirehopper; 01-30-10 at 04:12 PM .
Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
Thread Tools
Display Modes
Linear Mode
Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is Off