|
GetDeltaAPI
Get new items from a specific API (by time)
getDeltapermalink.getDeltaReturns full HTML content for all sources in our index. This also includes our content extract support which allows our users to skip indexing chrome within HTML posts. Each item is the full HTML of a permalink published by a site in our index. Each time we visit a site we fingerprint URLs on the front page and only crawl new URLs. We also crawl every URL found via RSS and Atom feeds. comment.getDeltaComment API endpoint. Returns new comments in the blogosphere. feed.getDeltaRSS API for finding new RSS posts in the blogosphere. Additional InformationFor additional information please see: Parameters
See GeneralParameters for additional documentation. Example Response<item> <title>Gillmor Gang Digests Comcast/Plaxo Deal</title> <link>http://www.techcrunch.com/2008/05/14/gillmor-gang-digests-comcastplaxo-deal/</link> <guid>http://techcrunch.com/2008/05/14/gillmor-gang-digests-comcastplaxo-deal</guid> <pubDate>Thu, 15 May 2008 05:35:33 GMT</pubDate> <dc:source>http://www.techcrunch.com</dc:source> <weblog:title>TechCrunch</weblog:title> <weblog:description> TechCrunch profiles the companies, products and events that are defining and transforming the new web. TechCrunch is written by Michael Arrington. </weblog:description> <dc:lang>en</dc:lang> <weblog:tier>0</weblog:tier> <atom:author> <atom:name>Michael Arrington</atom:name> <atom:email></atom:email> <atom:link></atom:link> </atom:author> <weblog:indegree>10131</weblog:indegree> <weblog:iranking>606</weblog:iranking> <category>Company &#038; Product Profiles</category> <category>comcast</category> <category>Plaxo</category> <description> ... </description> <post:content_extract> ... </post:content_extract> <post:body> ... </post:body> <post:title>...</post:title> <weblog:publisher_type>WEBLOG</weblog:publisher_type> <atom:published></atom:published> <post:date_found>2008-05-15T05:35:33Z</post:date_found> <post:resource_guid>WhB3f4PRaxs</post:resource_guid> </item> From Spinn3r Reference Clientpermalink.getDelta
PermalinkClient client = new PermalinkClient();
PermalinkConfig config = new PermalinkConfig();
config.setVendor( "XXXXX" );
client.setConfig( config );
while( client.fetch() ) {
List<BaseItem> results = client.getResults();
}
|
Sign in to add a comment
client.fetch() doesn't return a boolean, so something is wrong with the example java code.
Perhaps it should be
while(client.hasMoreResults()) { client.fetch(); List<BaseItem> results = client.getResults(); //Process this subset of the results. }