Current location: Hot Scripts Forums » Programming Languages » PHP » php scraper help


php scraper help

Reply
  #1 (permalink)  
Old 01-25-10, 07:43 PM
yonu yonu is offline
Newbie Coder
 
Join Date: Jan 2010
Posts: 10
Thanks: 0
Thanked 0 Times in 0 Posts
php scraper help

hi all I am hoping someone can help me out I downloaded a scrapeing script from the web to setup a scraping for a friend from there myspace page so they only have to maintain one calender between the two sites problem is I can't get the script to scrap the site if anyone could help me it would be great I am including the script files as they are configured now.

Also I am extremely glad to be here

Thanks,
yonu
Attached Files
File Type: zip tmp.zip (1.9 KB, 50 views)
Reply With Quote
  #2 (permalink)  
Old 01-25-10, 09:39 PM
wirehopper's Avatar
wirehopper wirehopper is offline
-
 
Join Date: Feb 2006
Posts: 2,515
Thanks: 20
Thanked 109 Times in 106 Posts
It might be easier to use the API

API - Facebook Developer Wiki
Reply With Quote
  #3 (permalink)  
Old 01-25-10, 09:42 PM
yonu yonu is offline
Newbie Coder
 
Join Date: Jan 2010
Posts: 10
Thanks: 0
Thanked 0 Times in 0 Posts
not sure i understand how will the facebook api help me with myspace?
Reply With Quote
  #4 (permalink)  
Old 01-26-10, 06:06 AM
wirehopper's Avatar
wirehopper wirehopper is offline
-
 
Join Date: Feb 2006
Posts: 2,515
Thanks: 20
Thanked 109 Times in 106 Posts
Sorry about that. Regardless, most of these social networks have APIs so you can integrate their content into other systems.

Category:OpenSocial v0.9 REST Resources - MySpace Open Platform: Documentation Wiki
Reply With Quote
  #5 (permalink)  
Old 01-28-10, 11:50 PM
yonu yonu is offline
Newbie Coder
 
Join Date: Jan 2010
Posts: 10
Thanks: 0
Thanked 0 Times in 0 Posts
hey thanks for the info however i have gotten nowhere with the apis and open socail would it be possible for someone to help me with the script i posted or with how to get the api to pull what i need from the profile

Thanks,
yonu
Reply With Quote
  #6 (permalink)  
Old 01-29-10, 04:16 PM
Jcbones Jcbones is offline
Aspiring Coder
 
Join Date: Mar 2009
Location: North Carolina, USA
Posts: 516
Thanks: 5
Thanked 47 Times in 44 Posts
In order to scrape a site, you need two things.

1. The code you're working with. <which you included>
2. The HTML of the page you are trying to scrape.
Reply With Quote
  #7 (permalink)  
Old 01-29-10, 08:33 PM
yonu yonu is offline
Newbie Coder
 
Join Date: Jan 2010
Posts: 10
Thanks: 0
Thanked 0 Times in 0 Posts
if it helps here is the section of the page I am trying to scrape
I suppose i should have included it in my first post

Thanks,
yonu
Attached Files
File Type: txt source.txt (12.3 KB, 221 views)
Reply With Quote
  #8 (permalink)  
Old 01-29-10, 09:15 PM
wirehopper's Avatar
wirehopper wirehopper is offline
-
 
Join Date: Feb 2006
Posts: 2,515
Thanks: 20
Thanked 109 Times in 106 Posts
I don't think scraping the page is a good approach. That's why they build APIs.

Also, people are more likely to help if you post the code, rather than a file.
Reply With Quote
  #9 (permalink)  
Old 01-29-10, 09:29 PM
yonu yonu is offline
Newbie Coder
 
Join Date: Jan 2010
Posts: 10
Thanks: 0
Thanked 0 Times in 0 Posts
ok then here is the code and as top the api /I was unable to find a method to extract the required data using a API

Code:
<div id="profile_bandschedule">

<table bordercolor="#6699cc" cellspacing="0" cellpadding="0" width="440" bgcolor="#6699cc" border="0">
  <tr>
    <td>
      <table width="440" border="0" cellspacing="0" cellpadding="0">
        <tr>
          <td bgcolor="#6699cc" class="text" align="left" style="WORD-WRAP:break-word">&nbsp;&nbsp;&nbsp;<span class="whitetext12">Upcoming Shows</span></td>
          <td align="right"><font color="#ffffff" size="2" face="Arial, Helvetica, sans-serif"><span align="right" class="whitelink"><font size="1">( <a href="http://collect.myspace.com/index.cfm?fuseaction=bandprofile.listAllShows&friendid=141122568&n=Keynote+Company+(NEW+BLOGS)" class="whitelink">view all</a> )</font></span></font></td>

        </tr>
      </table>
    </td>
  </tr>
  <tr>
    <td style="PADDING-RIGHT: 3px; PADDING-LEFT: 3px; PADDING-BOTTOM: 3px; PADDING-TOP: 3px">
  
  
      <table width="440" border="0" cellspacing="2" cellpadding="2" bgcolor="#ffffff">
  
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">

            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Jan 30 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">5:00P</font></td>
              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553492">PROMO - A Loss For Words, Lions Lions, And Then There Were None, Thieves and Villains ++</a></font></td>

          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">ORIENT HEIGHTS COMMUNITY CENTER - EAST BOSTON, MA</font></td>
        </tr>
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">
            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Feb 9 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">7:00P</font></td>

              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38552863">World In Arms, Break Of Dawn, Scalpel, Destruction From Within, Vicitim of Circumstance ++</a></font></td>
          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">CLUB HELL - PROVIDENCE, RI</font></td>
        </tr>
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">

            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Feb 16 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">7:00P</font></td>
              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38549920">I AM GHOST (EPITAPH), Modern Day Escape (Standby), The Becoming (Tooth and Nail), locals TBA</a></font></td>

          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">CLUB HELL - PROVIDENCE, RI</font></td>
        </tr>
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">
            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Feb 23 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">7:00P</font></td>

              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38552864">Lost in Layaway, Sleep City, June and the Ocean, Worth The Weight, Stealing Harvard + 1 TBA</a></font></td>
          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">CLUB HELL - PROVIDENCE, RI</font></td>
        </tr>
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">

            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Feb 27 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">8:00P</font></td>
              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553562">HOLD</a></font></td>

          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
        </tr>
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">
            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Mar 2 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">5:00P</font></td>

              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553415">HOLD</a></font></td>
          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
        </tr>
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">

            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Mar 7 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">6:00P</font></td>
              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38555091">THE ATARIS !!!!  w/ Don’t Panic, Half Hearted Hero, Foredoes Me Quite + The Intel</a></font></td>

          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">PAL HALL - FALL RIVER, MA</font></td>
        </tr>
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">
            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Mar 14 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">2:00P</font></td>

              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38552040">TRANSIT (Run For Cover), The Stereo State, more TBA</a></font></td>
          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
        </tr>
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">

            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Mar 21 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">2:00P</font></td>
              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553218">HOLD</a></font></td>

          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
        </tr>
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">
            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Mar 27 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">8:00P</font></td>

              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553442">HOLD</a></font></td>
          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
        </tr>
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">

            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Apr 4 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">8:00P</font></td>
              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553441">HOLD</a></font></td>

          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
        </tr>
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">
            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Apr 9 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">5:00P</font></td>

              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38553820">HOLD</a></font></td>
          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
        </tr>
  
        <tr>
          <td width="120" bgcolor="#b1DOfO">

            <table width="120" border="0" cellspacing="2" cellpadding="0">
              <tr>
                
                <td width="85"><font size="1" face="Arial, Helvetica, sans-serif">Apr 19 2010</font></td>
                
                <td width="35" align="right"><font size="1" face="Arial, Helvetica, sans-serif">7:00P</font></td>
              </tr>
            </table>
          </td>
          <td width="191" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif"><a href="http://music.myspace.com/index.cfm?fuseaction=music.showDetails&friendid=141122568&Band_Show_ID=38555368">TBA</a></font></td>

          <td width="115" bgcolor="#d5e8fb"><font size="1" face="Arial, Helvetica, sans-serif">TBA</font></td>
        </tr>
  
      </table>
      
    </td>
  </tr>
</table>
</div>
Reply With Quote
  #10 (permalink)  
Old 01-30-10, 04:06 PM
wirehopper's Avatar
wirehopper wirehopper is offline
-
 
Join Date: Feb 2006
Posts: 2,515
Thanks: 20
Thanked 109 Times in 106 Posts

Last edited by wirehopper; 01-30-10 at 04:12 PM.
Reply With Quote
Reply

Bookmarks


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
2 profitable script sites for sale cms-master.com General Advertisements 3 07-03-07 10:17 AM
help with error messages.. please APuppyDog PHP 2 10-05-06 11:09 PM
PHP Downside--Solutions? Amulet PHP 10 07-15-05 08:26 AM


All times are GMT -5. The time now is 08:09 AM.
vBulletin® Copyright ©2000 - 2012, Jelsoft Enterprises Ltd.