Re: [patch 00/21] mutex subsystem, -V14

ISYNC_ON_SMP flushes all speculative reads currently in the queue - andis hence a smp_rmb_backwards() primitive [per my previous mail] - butdoes not affect writes - correct?
if that's the case, what prevents a store from within the criticalsection going up to right after the EIEIO_ON_SMP, but before theatomic-dec instructions? Does any of those instructions imply somebarrier perhaps? Are writes always ordered perhaps (like on x86 CPUs),and hence the store before the bne is an effective write-barrier?

It really makes more sense after reading PowerPC Book II, which you can find atthis link, it was written by people who explain this for a living:http://www-128.ibm.com/developerworks/eserver/articles/archguide.html

While isync technically doesn't order stores it does order instructions. Theprevious bne- must complete, that bne- is dependent on the previous stwcx beingcomplete. So no stores are slipping up. To get a better explanation you willhave to read the document yourself.

Here is a first pass at a powerpc file for the fast paths just as an FYI/RFC.It is completely untested, but compiles.

Signed-off-by: Joel Schopp <jschopp@austin.ibm.com>

Index: 2.6.15-mutex14/include/asm-powerpc/mutex.h
===================================================================
--- 2.6.15-mutex14.orig/include/asm-powerpc/mutex.h	2006-01-04 14:46:31.%N -0600
+++ 2.6.15-mutex14/include/asm-powerpc/mutex.h	2006-01-05 16:25:41.%N -0600
@@ -1,9 +1,83 @@
 /*
- * Pull in the generic implementation for the mutex fastpath.
+ * include/asm-powerpc/mutex.h
  *
- * TODO: implement optimized primitives instead, or leave the generic
- * implementation in place, or pick the atomic_xchg() based generic
- * implementation. (see asm-generic/mutex-xchg.h for details)
+ * PowerPC optimized mutex locking primitives
+ *
+ * Please look into asm-generic/mutex-xchg.h for a formal definition.
+ * Copyright (C) 2006 Joel Schopp <jschopp@austin.ibm.com>, IBM
  */
+#ifndef _ASM_MUTEX_H
+#define _ASM_MUTEX_H
+#define __mutex_fastpath_lock(count, fail_fn)\
+do{                                     \
+	long tmp;                       \
+	__asm__ __volatile__(		\
+"1:	lwarx		%0,0,%1\n"      \
+"	addic	        %0,%0,-1\n"     \
+"	stwcx.          %0,0,%1\n"      \
+"	bne-            1b\n"           \
+"	isync           \n"             \
+	: "=&r" (tmp)                   \
+	: "r" (&(count)->counter)       \
+	: "cr0", "memory");             \
+	if (unlikely(tmp < 0))          \
+		fail_fn(count);         \
+} while (0)                              
+
+#define __mutex_fastpath_unlock(count, fail_fn)\
+do{                                         \
+	long tmp;                           \
+	__asm__ __volatile__(SYNC_ON_SMP    \
+"1:	lwarx		%0,0,%1\n"          \
+"	addic	        %0,%0,1\n"          \
+"	stwcx.          %0,0,%1\n"          \
+"       bne-            1b\n"               \
+	: "=&r" (tmp)                       \
+	: "r" (&(count)->counter)           \
+	: "cr0", "memory");                 \
+	if (unlikely(tmp <= 0))             \
+		fail_fn(count);             \
+} while (0)
+
+
+static inline int 
+__mutex_fastpath_trylock(atomic_t* count, int (*fail_fn)(atomic_t*))
+{
+	long tmp;
+	__asm__ __volatile__(
+"1:     lwarx		%0,0,%1\n"
+"       cmpwi           0,%0,1\n"
+"       bne-            2f\n"
+"       stwcx.          %0,0,%1\n"
+"	bne-		1b\n"
+"	isync\n"
+"2:"
+	: "=&r" (tmp)
+	: "r" (&(count)->counter)
+	: "cr0", "memory");
+
+	return (int)tmp;
+
+}
+
+#define __mutex_slowpath_needs_to_unlock()		1
 
-#include <asm-generic/mutex-dec.h>
+static inline int 
+__mutex_fastpath_lock_retval(atomic_t* count, int (*fail_fn)(atomic_t *))
+{
+	long tmp;
+	__asm__ __volatile__(
+"1:	lwarx		%0,0,%1\n"
+"	addic	        %0,%0,-1\n"
+"	stwcx.          %0,0,%1\n"
+"	bne-            1b\n"
+"	isync           \n"
+	: "=&r" (tmp)
+	: "r" (&(count)->counter)
+	: "cr0", "memory");
+	if (unlikely(tmp < 0))
+		return fail_fn(count);
+	else
+		return 0;
+}
+#endif

Follow-Ups:
- Re: [patch 00/21] mutex subsystem, -V14
  - From: Olof Johansson <olof@lixom.net>
- Re: [patch 00/21] mutex subsystem, -V14
  - From: Linus Torvalds <torvalds@osdl.org>

References:
- [patch 00/21] mutex subsystem, -V14
  - From: Ingo Molnar <mingo@elte.hu>
- Re: [patch 00/21] mutex subsystem, -V14
  - From: Joel Schopp <jschopp@austin.ibm.com>
- Re: [patch 00/21] mutex subsystem, -V14
  - From: Ingo Molnar <mingo@elte.hu>
- Re: [patch 00/21] mutex subsystem, -V14
  - From: Joel Schopp <jschopp@austin.ibm.com>
- Re: [patch 00/21] mutex subsystem, -V14
  - From: Ingo Molnar <mingo@elte.hu>

Prev by Date: Oops with 2.6.15
Next by Date: [PATCH 2/3] boot with Gujin: core ia32 realmode files
Previous by thread: Re: [patch 00/21] mutex subsystem, -V14
Next by thread: Re: [patch 00/21] mutex subsystem, -V14
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]