CS791: 11/16 Lecture Notes

2 papers today: (3 if we're lucky)

Slide 1

Difficulties in changing from a unicase to a multicast paradigm for data transport

Slide 2

Defining Reliability for the purpose of Multicast:

Slide 3

SRM (Scalable Reliable Multicast): Key ideas

Slide 4

Objective: "minimal" definition of reliable multicast

Slide 5

Weak, "eventual" reliability

Feedback control (interesting part for us) is NACK-based

Slide 6

Algorithm for loss recovery (essential aspects of paper):

Slide 7

Simulation / measurement : loss analysis


Question: Gabe: "sltn to repair latency - what if we pair receivers?"
Answer: Problems:
  1. how do you set up the pairs? How do you make them loss-disjoint? This maes them distant - higher latencies
  2. what if your pair quits the session - keepalive, ping, etc pair-changing, etc
  3. what if you both lose anyway


Question: Khaled: "what keeps C and T from re-transmitting after B?"
Answer: B's re-xmission also reaches C and T (does it? scoped? what size is the scope?) and they suppress.
What about out-of-order? Hmm - this makes it trickier.
This paper: "really nice ideas, seems to work well in practice, but all the issues haven't been solved"

Slide 8

Scalability issues:

Slide 9

Local Recovery (next paper will use this extensively)


Question: Khaled: "are we using app-level framing?"
Answer: yes - assumption about drawops and idempotency - drawops fit into individual packets (generally)
problem: what if an op spans multiple packets? have to send/request multiple packets. in-other-words: application-level objects are "named" so they can be requested. logically = "presentation layer" (if you believe in such a thing)
point/issue : naming : sequence numbers may not be all that useful, whereas ADU (application data unit) may convey the needed (by application) information
assumption: non-conflicting namespaces (global-uique-host-IP:host-wide-unique seq)

Slide 10

PPV: A Response to SRM.

Problems w/ SRM:

Better job at addressing these 3 probs

Slide 11

Error Correction: Goals

Slide 12

Error Correction: Algorithms (Use picture fm paper, f.e. fig 4)

Why this approach is difficult: it requires routers to be smarter/do more.

Simulations in this paper = "good start at analyzing this" = powerful mechanism for local recovery

Clarification: "turning point" = router "above" a replier in distribution tree

Question: "Can a 'turning point' also be a replier?"
Answer: yes, but that would SEVERELY/MAJORLY change the router's role (as opposed to just a BIG change in the router's role)
Only a replier's NACK gets forwarded upstream - so router has to know who its immediate replier is

Slide XXX

Pros/Cons:

Slide XXX

That's it - next week:

Adam Bradley (artdodge@cs.bu.edu)