Adam Green
Twitter API Consultant
adam@140dev.com
781-879-2960
@140dev

Aggregating tweets: Search API vs. Streaming API

As part of upgrading the 140dev.com site for API 1.1, this page has replaced the old posting on this subject.

My attitude towards the search API has changed since the early days when Twitter bought it from Summize. I had real doubts back then as to Twitter’s ability and inclination to integrate the search API correctly. I was wrong. Search has been fixed, strengthened and integrated. Both search and streaming APIs are now essential parts of Twitter programming for Tweet collection. It is still useful to lay out their differences side by side. It isn’t a matter of using one or the other. Now you need to know when to use each API for maximum efficiency.

Past vs. Future
This is the essential difference between the two APIs. Search goes back in time and streaming goes forward. When you first decide to collect tweets on a subject, you have nothing to start with. The search API is best used then to fill in the past 7 days for your subject matter. This is often called back-filling. With a database caught up to the present moment, you can then turn on the streaming API and capture all tweets going forward.

Both need OAuth to connect to Twitter
The search API used to be limited just by IP address with no login required, but since version 1.1 was released, you have to log in with OAuth for all requests, including search and streaming. If you never learned how to use OAuth, this free e-book will get you started.

Rate limit rules are completely different
There is a subtle, yet very powerful difference between search and streaming API when it comes to OAuth tokens. You are only allowed to make a single streaming API OAuth connection for each twitter account that owns the app. No matter how many people log into your site, the app they all log into can only use a single streaming API connection. The search API, on the other hand, allows a separate rate limited bucket of requests for each user who logs into your app. There are many implications of this difference, but I can’t digress to cover them all here. The takeaway is that you have to plan your rate limit utilization with these two sets of limits in mind. You might find it more effective to use both APIs for different portions of your collection process.

Data formats are almost the same
I would never say that the search and streaming API return data in exactly the same formats, but the differences are small enough to not matter. You might have to tweak the collection scripts for each API, but the dependencies are isolated in those 2 scripts. The key is that the data each API returns is the same, even when their JSON return structures aren’t. You can safely mix data from both search and streaming into the same database. After that is done it doesn’t matter where the data originally came from.

Search API has more powerful queries
The search API has a fairly rich set of operators that can filter results based on attributes like location of sender, language, and various popularity measurements. The streaming API has a more limited approach of only collecting tweets containing words, sent by specific accounts, or within a geographic area.

Seach API can collect a wider range of data
The targets for tweet collection vary in several ways. The streaming API can collect all tweets that contain up to 400 keyword phrases, were sent by up to 5,000 accounts, and originated in up to 25 geographic areas. The exact limit on search API queries aren’t documented, but it is a good estimate that a query cannot contain greater than 15-20 keywords. On the other hand, you can make up to 15 search API requests a minute. That works out to about 250 keywords being searched each minute, or 15,000 keywords an hour. It is possible to switch the streaming keywords, but not at as high a rate as search.

Streaming API usually returns a much higher flow of tweets
Another limit that isn’t documented is the total flow from the streaming API. The docs say up to 1% of the full firehose of tweets. I’ve found that the streaming API has maxed out at around 3,000 tweets a minute, although that may have changed. This delivers a maximum flow of 180,000 tweets an hour. The search API returns up to 100 tweets per search and allows 720 requests per hour, giving us a max of 72,000 tweets per hour. On the other, other hand, if each user who logs into your app asks you to make search requests, then you can get up to 72,000 tweets per hour for every user.

You can see that this comparison is not as easy as it once was. If you need to squeeze out the maximum results from Twitter, you need to juggle the various factors to get the best combination of both search and streaming API calls. In the simplest case where you have a relatively fixed set of keywords, you should first run a search to collect the old tweets going back a week or so. Then turn on the streaming API for the same keywords.

If this is a new type of programming for you, check out my free library for streaming API tweet collection, and the examples of source code for searching on this site.

Related Twitter API Programming Tutorials

Make a smooth transition to API 1.1
with Adam Green’s first book on
Twitter API programming

Adam Green has been building custom Twitter applications for 5 years. Thousands of people have used his open source libraries, ebooks, and tutorials to build their own websites and online tools based on the Twitter API.

Now Green has harnessed his years of coding experience to present the secret formula for successful Twitter Engagement programming.

Collect. Identify. Engage.

Collect a complete engagement profile for the most influential accounts for any area of interest. These tweets, mentions, friends, and followers will be stored in a MySQL database on your own server.

Identify the users you need to interact with, including the most mentioned tweets and tags. The reporting system included as source code can easily be enhanced to select exactly the type of users you need to help spread your message.

Engage with these key accounts using automated tweets and DMs. Follow new accounts that have been selected based on your specific needs, and track these accounts to know when they follow back.

All of the PHP source code for this book is available as open source under the MIT license.

Free Javascript Ebook

Tutorials on jQuery and Ajax
Getting a user timeline with Javascript
Javascript coding with the Search API
Complete client app with tweet display

Free Twitter OAuth Ebook

Posting tweets through the API
Looking up account details for Twitter users
Converting Twitter API docs into working PHP code
API console for debugging API requests

Hire Adam Green to Develop Custom Twitter Solutions

Contact Adam to schedule an initial consultation
by phone at (781) 879-2960
or email at adam@140dev.com Follow @140dev
Learn More

Testimonials

“When I started investigating on how to build my Twitter based Site the Twitter API made my head spin. Sure it was powerful but where do I even begin? Then I found the 140dev Streaming API Framework and it made Twitter development a LOT easier! Without it I would probably spend hundreds of hours just on standard functions such as connecting to the Twitter API, managing data and displaying results. Thanks 140DEV for such a cool tool!"

- Paul Valkama, http://StreamClock.com

“The most important part of any technology effort is the nature of the people you work with. The people at 140dev are world class best. They have years of experience, a deep knowledge of Internet technologies, extensive familiarity of the Twitter API and a proven ability to deliver. They also work great with others. Frankly they have been so incredible I am amazed and really hope we can continue the relationship for a decade."

- Bob Gourley, http://twitchimp.com

“We worked with Adam and the 140dev team to build a Twitter business focusing on healthcare companies. 140dev brought unparalleled knowledge of Twitter mechanics, but also an excellent understanding of how Twitter fits into business strategy. They proved to be ideal development partners: committed to reaching our project’s goals, generous with sharing their knowledge, and dedicated to delivering beyond our expectations, despite working across continents and time zones. We now see them as an extended part of our business and hope to work with them in many future projects."

- Kevin Michels-Kim and Gerold Geis, http://starling140.com

140dev Google Groups

You can share your questions and experiences with the Twitter API on these discussion forums
140dev Streaming API Framework
Twitter API Tools
Twitter OAuth Discussion
Recent Blog Posts
Recent Twitter API Programming Tutorials

Javascript Programming for Twitter API 1.1

Single-user Twitter OAuth Programming

Identifying influential Twitter users

Aggregating tweets: Search API vs. Streaming API

Making aggregated tweets visible to Google for SEO

Advantages of a Twitter API database cache
Blog Categories

Aggregating tweets: Search API vs. Streaming API

Related Twitter API Programming Tutorials

Hire Adam Green to Develop Custom Twitter Solutions

Testimonials

140dev Google Groups

Recent Blog Posts

Recent Twitter API Programming Tutorials

Blog Categories