Adam Green
Twitter API Consultant
adam@140dev.com
781-879-2960
@140dev

Streaming API: Multi-level tweet collection databases

by Adam Green on February 14, 2014Tweet

in Streaming API

Yesterday’s streaming API post described a multiple server model for handling high rate tweet collection. Today I’d like to cover a different architecture that addresses this problem with a single server running multiple databases.

Let’s say you want to display tweets for the most active stocks each day. The streaming API lets you collect tweets for 400 keywords, or in this case, the 400 most active stock symbols. That will be a high flow rate, and a large database to query if your site only needs to display tweets for 20 or 30 stocks at any one time.

A solution is to store all the tweets, users and related data you receive for all 400 stocks in one database, we’ll call it tweet_collect. You can then create a separate database, it can be called tweet_serve, and have your code copy just the tweets for active stocks to this database as they arrive. Your website only needs to read from tweet_serve, which will be much smaller and therefore deliver query results faster.

When a new stock becomes active, you will already have its tweets available in tweet_collect, so you can quickly copy its tweets to tweet_serve and be ready to display on the site. When the stock is no longer active, you can delete its data from tweet_serve.

The limitation of this technique is that you are limited topics that can be covered adequately within the limit of 400 keywords. As long as this fits your application needs, this model will produce a much faster website display.

When a keyword becomes active that isn’t in your normal collection list, you can fill in the data for this with the search API as needed. Search isn’t as powerful as streaming for large amounts of data, but if you need ad hoc collection of tweets for a few extra keywords, it does a good job. You can query it up to 720 times an hour and request tweets for about 10 to 15 keywords each time. These tweets would also go into the tweet_serve database.

Leave a Comment

Previous post: Twitter API Tools: Get list member user profiles

Next post: Twitter API Tools: Get lists owned by a Twitter account

Make a smooth transition to API 1.1
with Adam Green’s first book on
Twitter API programming

Adam Green has been building custom Twitter applications for 5 years. Thousands of people have used his open source libraries, ebooks, and tutorials to build their own websites and online tools based on the Twitter API.

Now Green has harnessed his years of coding experience to present the secret formula for successful Twitter Engagement programming.

Collect. Identify. Engage.

Collect a complete engagement profile for the most influential accounts for any area of interest. These tweets, mentions, friends, and followers will be stored in a MySQL database on your own server.

Identify the users you need to interact with, including the most mentioned tweets and tags. The reporting system included as source code can easily be enhanced to select exactly the type of users you need to help spread your message.

Engage with these key accounts using automated tweets and DMs. Follow new accounts that have been selected based on your specific needs, and track these accounts to know when they follow back.

All of the PHP source code for this book is available as open source under the MIT license.

Free Javascript Ebook

Tutorials on jQuery and Ajax
Getting a user timeline with Javascript
Javascript coding with the Search API
Complete client app with tweet display

Free Twitter OAuth Ebook

Posting tweets through the API
Looking up account details for Twitter users
Converting Twitter API docs into working PHP code
API console for debugging API requests

Hire Adam Green to Develop Custom Twitter Solutions

Contact Adam to schedule an initial consultation
by phone at (781) 879-2960
or email at adam@140dev.com Follow @140dev
Learn More

Testimonials

“When I started investigating on how to build my Twitter based Site the Twitter API made my head spin. Sure it was powerful but where do I even begin? Then I found the 140dev Streaming API Framework and it made Twitter development a LOT easier! Without it I would probably spend hundreds of hours just on standard functions such as connecting to the Twitter API, managing data and displaying results. Thanks 140DEV for such a cool tool!"

- Paul Valkama, http://StreamClock.com

“The most important part of any technology effort is the nature of the people you work with. The people at 140dev are world class best. They have years of experience, a deep knowledge of Internet technologies, extensive familiarity of the Twitter API and a proven ability to deliver. They also work great with others. Frankly they have been so incredible I am amazed and really hope we can continue the relationship for a decade."

- Bob Gourley, http://twitchimp.com

“We worked with Adam and the 140dev team to build a Twitter business focusing on healthcare companies. 140dev brought unparalleled knowledge of Twitter mechanics, but also an excellent understanding of how Twitter fits into business strategy. They proved to be ideal development partners: committed to reaching our project’s goals, generous with sharing their knowledge, and dedicated to delivering beyond our expectations, despite working across continents and time zones. We now see them as an extended part of our business and hope to work with them in many future projects."

- Kevin Michels-Kim and Gerold Geis, http://starling140.com

140dev Google Groups

You can share your questions and experiences with the Twitter API on these discussion forums
140dev Streaming API Framework
Twitter API Tools
Twitter OAuth Discussion
Recent Blog Posts
Recent Twitter API Programming Tutorials

Javascript Programming for Twitter API 1.1

Single-user Twitter OAuth Programming

Identifying influential Twitter users

Aggregating tweets: Search API vs. Streaming API

Making aggregated tweets visible to Google for SEO

Advantages of a Twitter API database cache
Blog Categories