Adam Green
Twitter API Consultant
adam@140dev.com
781-879-2960
@140dev

Overcoming 502 errrors while backfilling tweets

by Adam Green on February 4, 2011Tweet

in Rate Limits,Streaming API,Twitter Politics,Twitter Server Errors

I’m collecting all the tweets for possible 2012 candidates with the Streaming API, and I wanted to make sure I was getting every one of their tweets. I built a backfilling script to go through every tweet in each of these accounts, and add any that weren’t already in the database. This uses the /statuses/user_timeline call to get the past tweets. I ran into a problem with tons of 502 errors from the Twitter API, as many as one every two or three API calls.

Taylor Singletary on the Dev mailing list suggested dropping the count parameter to avoid timeout errors, and this has helped a lot. I was using a count of 200 tweets per call to keep the number of calls low. This gave me all the data in about 100 calls, but with the errors I wasn’t able to complete the process before hitting the rate limit. I tried dropping the count to 100, and this allowed the script to finish with a total of 298 calls.

So now I have the catch 22 of needing to do more API calls to avoid the errors that cause too many API calls. The only solution I see is to cut the count parameter to a level that is low enough to avoid errors, and then spread the backfilling out over multiple hours to stay within the rate limit.

I think the ultimate solution is to do a steady level of backfilling spread over the entire day. I haven’t had to do backfilling in the past, because I was treating the Streaming API tweet collection as a high volume sampling mechanism. As long as I got lots of tweets on a particular subject, it was good. Now that I want to maintain a database of every tweet made by the candidates I have to backfill to make sure nothing was missed by streaming. This seems to be necessary, since every time I run the backfill I get two to three tweets that didn’t get sent by streaming.

Tagged as: 2012 candidates

Previous post: Twitter Politics: Collecting tweets for potential 2012 candidates

Next post: Automatic restarting of stream connection

Make a smooth transition to API 1.1
with Adam Green’s first book on
Twitter API programming

Adam Green has been building custom Twitter applications for 5 years. Thousands of people have used his open source libraries, ebooks, and tutorials to build their own websites and online tools based on the Twitter API.

Now Green has harnessed his years of coding experience to present the secret formula for successful Twitter Engagement programming.

Collect. Identify. Engage.

Collect a complete engagement profile for the most influential accounts for any area of interest. These tweets, mentions, friends, and followers will be stored in a MySQL database on your own server.

Identify the users you need to interact with, including the most mentioned tweets and tags. The reporting system included as source code can easily be enhanced to select exactly the type of users you need to help spread your message.

Engage with these key accounts using automated tweets and DMs. Follow new accounts that have been selected based on your specific needs, and track these accounts to know when they follow back.

All of the PHP source code for this book is available as open source under the MIT license.

Free Javascript Ebook

Tutorials on jQuery and Ajax
Getting a user timeline with Javascript
Javascript coding with the Search API
Complete client app with tweet display

Free Twitter OAuth Ebook

Posting tweets through the API
Looking up account details for Twitter users
Converting Twitter API docs into working PHP code
API console for debugging API requests

Hire Adam Green to Develop Custom Twitter Solutions

Contact Adam to schedule an initial consultation
by phone at (781) 879-2960
or email at adam@140dev.com Follow @140dev
Learn More

Testimonials

“When I started investigating on how to build my Twitter based Site the Twitter API made my head spin. Sure it was powerful but where do I even begin? Then I found the 140dev Streaming API Framework and it made Twitter development a LOT easier! Without it I would probably spend hundreds of hours just on standard functions such as connecting to the Twitter API, managing data and displaying results. Thanks 140DEV for such a cool tool!"

- Paul Valkama, http://StreamClock.com

“The most important part of any technology effort is the nature of the people you work with. The people at 140dev are world class best. They have years of experience, a deep knowledge of Internet technologies, extensive familiarity of the Twitter API and a proven ability to deliver. They also work great with others. Frankly they have been so incredible I am amazed and really hope we can continue the relationship for a decade."

- Bob Gourley, http://twitchimp.com

“We worked with Adam and the 140dev team to build a Twitter business focusing on healthcare companies. 140dev brought unparalleled knowledge of Twitter mechanics, but also an excellent understanding of how Twitter fits into business strategy. They proved to be ideal development partners: committed to reaching our project’s goals, generous with sharing their knowledge, and dedicated to delivering beyond our expectations, despite working across continents and time zones. We now see them as an extended part of our business and hope to work with them in many future projects."

- Kevin Michels-Kim and Gerold Geis, http://starling140.com

140dev Google Groups

You can share your questions and experiences with the Twitter API on these discussion forums
140dev Streaming API Framework
Twitter API Tools
Twitter OAuth Discussion
Recent Blog Posts
Recent Twitter API Programming Tutorials

Javascript Programming for Twitter API 1.1

Single-user Twitter OAuth Programming

Identifying influential Twitter users

Aggregating tweets: Search API vs. Streaming API

Making aggregated tweets visible to Google for SEO

Advantages of a Twitter API database cache
Blog Categories