Explain what is Apache Storm? What are the components of Storm?
Apache storm is an open source distributed real-time computation system used for processing real time big data analytics. Unlike Hadoop batch processing, Apache storm does for real-time processing and can be used with any programming language.
Components of Apache Storm includes
Nimbus: It works as a Hadoop’s Job Tracker. It distributes code across the cluster, uploads computation for execution, allocate workers across the cluster and monitors computation and reallocates workers as needed
Zookeeper: It is used as a mediator for communication with the Storm Cluster
Supervisor: Interacts with Nimbus through Zookeeper, depending on the signals received from the Nimbus, it executes the process.
Does Apache act as a Proxy server?
Yes, It acts as proxy also by using the mod_proxy module.
Why Apache Storm is the first choice for Real Time Processing?
Easy to operate: Operating storm is quiet easy
Real fast: It can process 100 messages per second per node
Fault Tolerant: It detects the fault automatically and re-starts the functional attributes
Reliable: It guarantees that each unit of data will be executed at least once or exactly once
Scalable: It runs across a cluster of machine
What is multiviews?
A MultiViews search is enabled by the MultiViews Options.
If the server receives a request for /some/dir/foo and
/some/dir/foo does not exist, then the server reads the
directory looking for all files named foo.*, and effectively
fakes up a type map which names all those files, assigning
them the same media types and content-encodings it would
have if the client had asked for one of them by name. It
then chooses the best match to the client's requirements,
and returns that document.
Explain how data is stream flow in Apache Storm?
In Apache storm, data is stream flow with three components Spout, Bolt and Tuple