Wednesday 15 August 2012

streaming - Which is the best data store for generating realtime reports? -



streaming - Which is the best data store for generating realtime reports? -

i wanted know best info store storing , accessing concurrent info after seconds info source queue likes kafka or rabitmq , should scalable.

there 2 approaches: (1) if have lots of info (big data), (2) if have .. fewer data.

in 2nd case can follow traditional approach db (relational or document e.g. mongodb or whatever; depends info modeling needs).

in 1st case can utilize kafka aggregate info multiple resources , save them in hdfs. can storm on hadoop or spark execute analytical queries.

ps: kafka similar rabbitmq (and more general jms) improve big data.

streaming real-time bigdata datastore

No comments:

Post a Comment