1. 22 Mar, 2012 19 commits
  2. 31 Oct, 2011 1 commit
  3. 25 Oct, 2011 3 commits
  4. 16 Sep, 2011 1 commit
  5. 26 Jul, 2011 1 commit
  6. 19 May, 2011 5 commits
  7. 03 May, 2011 1 commit
  8. 04 Mar, 2011 3 commits
    • Sage Weil's avatar
      libceph: fix msgr standby handling · e00de341
      Sage Weil authored
      
      The standby logic used to be pretty dependent on the work requeueing
      behavior that changed when we switched to WQ_NON_REENTRANT.  It was also
      very fragile.
      
      Restructure things so that:
       - We clear WRITE_PENDING when we set STANDBY.  This ensures we will
         requeue work when we wake up later.
       - con_work backs off if STANDBY is set.  There is nothing to do if we are
         in standby.
       - clear_standby() helper is called by both con_send() and con_keepalive(),
         the two actions that can wake us up again.  Move the connect_seq++
         logic here.
      Signed-off-by: default avatarSage Weil <sage@newdream.net>
      e00de341
    • Sage Weil's avatar
      libceph: fix msgr keepalive flag · e76661d0
      Sage Weil authored
      
      There was some broken keepalive code using a dead variable.  Shift to using
      the proper bit flag.
      Signed-off-by: default avatarSage Weil <sage@newdream.net>
      e76661d0
    • Sage Weil's avatar
      libceph: fix msgr backoff · 60bf8bf8
      Sage Weil authored
      With commit f363e45f
      
       we replaced a bunch of hacky workqueue mutual
      exclusion logic with the WQ_NON_REENTRANT flag.  One pieces of fallout is
      that the exponential backoff breaks in certain cases:
      
       * con_work attempts to connect.
       * we get an immediate failure, and the socket state change handler queues
         immediate work.
       * con_work calls con_fault, we decide to back off, but can't queue delayed
         work.
      
      In this case, we add a BACKOFF bit to make con_work reschedule delayed work
      next time it runs (which should be immediately).
      Signed-off-by: default avatarSage Weil <sage@newdream.net>
      60bf8bf8
  9. 03 Mar, 2011 1 commit
    • Sage Weil's avatar
      libceph: retry after authorization failure · 692d20f5
      Sage Weil authored
      
      If we mark the connection CLOSED we will give up trying to reconnect to
      this server instance.  That is appropriate for things like a protocol
      version mismatch that won't change until the server is restarted, at which
      point we'll get a new addr and reconnect.  An authorization failure like
      this is probably due to the server not properly rotating it's secret keys,
      however, and should be treated as transient so that the normal backoff and
      retry behavior kicks in.
      Signed-off-by: default avatarSage Weil <sage@newdream.net>
      692d20f5
  10. 25 Jan, 2011 2 commits
    • Sage Weil's avatar
      libceph: fix socket write error handling · 42961d23
      Sage Weil authored
      
      Pass errors from writing to the socket up the stack.  If we get -EAGAIN,
      return 0 from the helper to simplify the callers' checks.
      Signed-off-by: default avatarSage Weil <sage@newdream.net>
      42961d23
    • Sage Weil's avatar
      libceph: fix socket read error handling · 98bdb0aa
      Sage Weil authored
      
      If we get EAGAIN when trying to read from the socket, it is not an error.
      Return 0 from the helper in this case to simplify the error handling cases
      in the caller (indirectly, try_read).
      
      Fix try_read to pass any error to it's caller (con_work) instead of almost
      always returning 0.  This let's us respond to things like socket
      disconnects.
      Signed-off-by: default avatarSage Weil <sage@newdream.net>
      98bdb0aa
  11. 12 Jan, 2011 1 commit
    • Tejun Heo's avatar
      net/ceph: make ceph_msgr_wq non-reentrant · f363e45f
      Tejun Heo authored
      
      ceph messenger code does a rather complex dancing around multithread
      workqueue to make sure the same work item isn't executed concurrently
      on different CPUs.  This restriction can be provided by workqueue with
      WQ_NON_REENTRANT.
      
      Make ceph_msgr_wq non-reentrant workqueue with the default concurrency
      level and remove the QUEUED/BUSY logic.
      
      * This removes backoff handling in con_work() but it couldn't reliably
        block execution of con_work() to begin with - queue_con() can be
        called after the work started but before BUSY is set.  It seems that
        it was an optimization for a rather cold path and can be safely
        removed.
      
      * The number of concurrent work items is bound by the number of
        connections and connetions are independent from each other.  With
        the default concurrency level, different connections will be
        executed independently.
      Signed-off-by: default avatarTejun Heo <tj@kernel.org>
      Cc: Sage Weil <sage@newdream.net>
      Cc: ceph-devel@vger.kernel.org
      Signed-off-by: default avatarSage Weil <sage@newdream.net>
      f363e45f
  12. 14 Dec, 2010 1 commit
  13. 09 Nov, 2010 1 commit
    • Sage Weil's avatar
      ceph: explicitly specify page alignment in network messages · c5c6b19d
      Sage Weil authored
      
      The alignment used for reading data into or out of pages used to be taken
      from the data_off field in the message header.  This only worked as long
      as the page alignment matched the object offset, breaking direct io to
      non-page aligned offsets.
      
      Instead, explicitly specify the page alignment next to the page vector
      in the ceph_msg struct, and use that instead of the message header (which
      probably shouldn't be trusted).  The alloc_msg callback is responsible for
      filling in this field properly when it sets up the page vector.
      Signed-off-by: default avatarSage Weil <sage@newdream.net>
      c5c6b19d