[PATCH]: buddy allocator: ext3 failed to alloc with __GFP_NOFAIL

Hello Andrew,

we had the following ext3 problems once during stress testing (on 2.6.8):
-----------------------------------------------------------------
journal_get_undo_access: No memory for committed data

ext3_free_blocks: aborting transaction: Out of memory in__ext3_journal_get_undo_access<2>EXT3-fs error (device hda7) inext3_free_blocks: Out of memory

Aborting journal on device hda7.
EXT3-fs error (device hda7) in ext3_ordered_commit_write: IO failure
ext3_abort called.
EXT3-fs abort (device hda7): ext3_journal_start: Detected aborted journal
Remounting filesystem read-only

ext3_free_blocks: aborting transaction: Journal has aborted in__ext3_journal_get_undo_access<2>EXT3-fs error (device hda7) inext3_free_blocks: Journal has abortedext3_reserve_inode_write: aborting transaction: Journal has aborted in__ext3_journal_get_write_access<2>EXT3-fs error (device hda7) inext3_reserve_inode_write: Journal has aborted

EXT3-fs error (device hda7) in ext3_truncate: Out of memory

ext3_reserve_inode_write: aborting transaction: Journal has aborted in__ext3_journal_get_write_access<2>EXT3-fs error (device hda7) inext3_reserve_inode_write: Journal has aborted

EXT3-fs error (device hda7) in ext3_orphan_del: Journal has aborted

ext3_reserve_inode_write: aborting transaction: Journal has aborted in__ext3_journal_get_write_access<2>EXT3-fs error (device hda7) inext3_reserve_inode_write: Journal has aborted

EXT3-fs error (device hda7) in ext3_delete_inode: Out of memory
__journal_remove_journal_head: freeing b_committed_data
..................
------------------------------------------------------------------

As it is seen from the messages journal_get_undo_access() failed toallocate some memory with jbd_kmalloc(), which in turn called kmalloc()with __GFP_NOFAIL flag.

How could it happen?

The only possible reason for this we suppose is a piece of code in__alloc_pages():

if ((p->flags & (PF_MEMALLOC | PF_MEMDIE)) && !in_interrupt()) {
        /* go through the zonelist yet again, ignoring mins */
        for (i = 0; zones[i] != NULL; i++) {
                struct zone *z = zones[i];

                page = buffered_rmqueue(z, order, gfp_mask);
                if (page) {
                        zone_statistics(zonelist, z);
                        goto got_pg;
                }
        }
        goto nopage;				<<<< HERE!!! FAIL...
}

So kswapd (which has PF_MEMALLOC flag) can fail to allocate memory evenwhen it allocates it with __GFP_NOFAIL flag.

Signed-Off-By: Pavel Emelianov <xemul@sw.ru>
Signed-Off-By: Denis Lunev <den@sw.ru>
Signed-Off-By: Kirill Korotaev <dev@sw.ru>

The attached patch should fix the problem (against 2.6.14).

Kirill

--- ./mm/page_alloc.c.mmreb	2005-10-28 04:02:08.000000000 +0400
+++ ./mm/page_alloc.c	2005-11-08 16:32:36.000000000 +0300
@@ -867,6 +867,7 @@ zone_reclaim_retry:
 
 	/* This allocation should allow future memory freeing. */
 
+rebalance:
 	if (((p->flags & PF_MEMALLOC) || unlikely(test_thread_flag(TIF_MEMDIE)))
 			&& !in_interrupt()) {
 		if (!(gfp_mask & __GFP_NOMEMALLOC)) {
@@ -879,14 +880,13 @@ zone_reclaim_retry:
 					goto got_pg;
 			}
 		}
-		goto nopage;
+		goto empty;
 	}
 
 	/* Atomic allocations - we can't balance anything */
 	if (!wait)
 		goto nopage;
 
-rebalance:
 	cond_resched();
 
 	/* We now go into synchronous reclaim */
@@ -946,6 +946,7 @@ rebalance:
 	 * In this implementation, __GFP_REPEAT means __GFP_NOFAIL for order
 	 * <= 3, but that may not be true in other implementations.
 	 */
+empty:
 	do_retry = 0;
 	if (!(gfp_mask & __GFP_NORETRY)) {
 		if ((order <= 3) || (gfp_mask & __GFP_REPEAT))

Follow-Ups:
- Re: [PATCH]: buddy allocator: ext3 failed to alloc with __GFP_NOFAIL
  - From: Kirill Korotaev <dev@sw.ru>

Prev by Date: Re: [PATCH 19/21] i386 Kprobes semaphore fix
Next by Date: Re: [PATCH] provide stricmp
Previous by thread: PROBLEM: b44 network driver performance
Next by thread: Re: [PATCH]: buddy allocator: ext3 failed to alloc with __GFP_NOFAIL
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]