My favorites | Sign in
Project Logo
                
Search
for
Updated Sep 14, 2009 by burtonator
GetDeltaAPI  
Get new items from a specific API (by time)

getDelta

permalink.getDelta

Returns full HTML content for all sources in our index. This also includes our content extract support which allows our users to skip indexing chrome within HTML posts.

Each item is the full HTML of a permalink published by a site in our index.

Each time we visit a site we fingerprint URLs on the front page and only crawl new URLs. We also crawl every URL found via RSS and Atom feeds.

comment.getDelta

Comment API endpoint. Returns new comments in the blogosphere.

feed.getDelta

RSS API for finding new RSS posts in the blogosphere.

Additional Information

For additional information please see:

Parameters

after Returns posts created after a given timestamp

See GeneralParameters for additional documentation.

Example Response

<item>

<title>Gillmor Gang Digests Comcast/Plaxo Deal</title>

<link>http://www.techcrunch.com/2008/05/14/gillmor-gang-digests-comcastplaxo-deal/</link>
<guid>http://techcrunch.com/2008/05/14/gillmor-gang-digests-comcastplaxo-deal</guid>

<pubDate>Thu, 15 May 2008 05:35:33 GMT</pubDate>

<dc:source>http://www.techcrunch.com</dc:source>

<weblog:title>TechCrunch</weblog:title>
<weblog:description>
TechCrunch profiles the companies, products and events that are defining
and transforming the new web. TechCrunch is written by Michael Arrington.
</weblog:description>

<dc:lang>en</dc:lang>
<weblog:tier>0</weblog:tier>

<atom:author>
  <atom:name>Michael Arrington</atom:name>
  <atom:email></atom:email>
  <atom:link></atom:link>
</atom:author>

<weblog:indegree>10131</weblog:indegree>
<weblog:iranking>606</weblog:iranking>

<category>Company &amp;#038; Product Profiles</category>
<category>comcast</category>
<category>Plaxo</category>

<description>
...
</description>

<post:content_extract>
...
</post:content_extract>

<post:body>
...
</post:body>

<post:title>...</post:title>

<weblog:publisher_type>WEBLOG</weblog:publisher_type>

<atom:published></atom:published>
<post:date_found>2008-05-15T05:35:33Z</post:date_found>

<post:resource_guid>WhB3f4PRaxs</post:resource_guid>

</item>

From Spinn3r Reference Client

permalink.getDelta

PermalinkClient client = new PermalinkClient();
PermalinkConfig config = new PermalinkConfig();
config.setVendor( "XXXXX" );
client.setConfig( config );

while( client.fetch() ) {
    List<BaseItem> results = client.getResults();
}

Comment by jeremybubs, Jul 28, 2009

client.fetch() doesn't return a boolean, so something is wrong with the example java code.

Comment by jeremybubs, Jul 28, 2009

Perhaps it should be

while(client.hasMoreResults()) {
  client.fetch();
  List<BaseItem> results = client.getResults();
  //Process this subset of the results.
}

Sign in to add a comment
Hosted by Google Code