The practice of syncing data in batches has given way to more frequent updates, and real-time streaming of data. The interconnected world of web and mobile apps, distributed sensor networks and cloud computing clusters require a new breed of data capture and analytics infrastructure that can handle the increasing volume and velocity of data.
The best way to get big data flowing in real-time is with middleware that takes care of message queuing and delivery so publishing applications and sensors can send data without worrying about where it needs to go or how it needs to get there. This entails the establishment and management of topics and queues, dynamic routing rules, and intelligent handling of fault conditions (such as applications or network links being down or slow).
Solace’s performance advantage and intelligent routing protocols enable a number of capabilities that make Solace’s solution uniquely well-suited to meeting the needs of big data capture, processing and distribution:
- Sharding: Sophisticated routing rules allow for guaranteed 1 of N delivery or more complicated multi-mode or high fan-out delivery of data.
- Sequencing: Solace can guarantee the sequential delivery of data elements across many downstream storage nodes and applications.
- De-duping: Solace enables the real-time de-duplication of data through the use of last value queues, sequence number generation and content-based selectors.
- Affinity Based Routing: Captured data elements need to be routed to the right storage location, and subsequent updates need to have affinity to that location. Solace uses dynamic address space and internode routing protocols to make sure every data element is routed appropriately.
- Data Buffering: Incoming data can spike to rates that exceed the average inbound rate. Solace appliances act as a big data “shock absorber” to smooth out the peaks without losing information.
- Replication & Disaster Recovery: Capturing data in a single location can expose your business to continuity problems. Solace’s appliances can efficiently replicate data streams across the WAN to provide failsafe delivery to remote datacenters.