linux - linux

	Commit message (Collapse)	Author	Age	Files	Lines
*	vfs: spread struct mount - mnt_has_parent	Al Viro	2012-01-04	3	-12/+13
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - do_umount/propagate_mount_busy	Al Viro	2012-01-04	3	-23/+23
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount mnt_set_mountpoint child argument	Al Viro	2012-01-04	3	-7/+7
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - clone_mnt/copy_tree argument	Al Viro	2012-01-04	3	-32/+35
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - shrink_submounts/select_submounts	Al Viro	2012-01-04	1	-14/+14
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - umount_tree argument	Al Viro	2012-01-04	3	-20/+20
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: the first spoils - mnt_hash moved	Al Viro	2012-01-04	3	-17/+18
\| \| \| \| \| \|	taken out of struct vfsmount into struct mount Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount to remaining users of ->mnt_hash	Al Viro	2012-01-04	1	-13/+13
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - clone_mnt/copy_tree result	Al Viro	2012-01-04	3	-26/+30
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - change_mnt_propagation/set_mnt_shared	Al Viro	2012-01-04	3	-14/+14
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - alloc_vfsmnt/free_vfsmnt/mnt_alloc_id/mnt_free_id	Al Viro	2012-01-04	1	-41/+40
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - tree_contains_unbindable	Al Viro	2012-01-04	1	-3/+3
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - attach_recursive_mnt	Al Viro	2012-01-04	1	-11/+14
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - mount group id handling	Al Viro	2012-01-04	3	-20/+20
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - commit_tree	Al Viro	2012-01-04	1	-9/+9
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - attach_mnt/detach_mnt	Al Viro	2012-01-04	1	-19/+22
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - namespace.c internal iterators	Al Viro	2012-01-04	1	-71/+74
\| \| \| \| \| \| \|	next_mnt() return value, first argument skip_mnt_tree() return value and argument Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - __propagate_umount() argument	Al Viro	2012-01-04	1	-7/+7
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: spread struct mount - __lookup_mnt() result	Al Viro	2012-01-04	5	-22/+31
\| \| \| \| \| \|	switch __lookup_mnt() to returning struct mount *; callers adjusted. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: start hiding vfsmount guts series	Al Viro	2012-01-04	2	-8/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Almost all fields of struct vfsmount are used only by core VFS (and a fairly small part of it, at that). The plan: embed struct vfsmount into struct mount, making the latter visible only to core parts of VFS. Then move fields from vfsmount to mount, eventually leaving only mnt_root/mnt_sb/mnt_flags in struct vfsmount. Filesystem code still gets pointers to struct vfsmount and remains unchanged; all such pointers go to struct vfsmount embedded into the instances of struct mount allocated by fs/namespace.c. When fs/namespace.c et.al. get a pointer to vfsmount, they turn it into pointer to mount (using container_of) and work with that. This is the first part of series; struct mount is introduced, allocation switched to using it. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: fix the rest of sget() races	Al Viro	2012-01-04	1	-7/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	unfortunately, just checking MS_BORN after having grabbed ->s_umount in sget() is not enough; places that pick superblock from a list and grab s_umount shared need the same check in addition to checking for ->s_root; otherwise three-way race between failing mount, sget() and such list-walker can leave us with list-walker coming second, when temporary active ref grabbed by sget() (to be dropped when sget() notices that original mount has failed by checking MS_BORN) has lead to deactivate_locked_super() from failing ->mount() not doing ->kill_sb() and just releasing ->s_umount. Once sget() gets through and notices that MS_BORN had never been set it will drop the active ref and fs will be shut down and kicked out of all lists, but it's too late for something like sync_supers(). Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: new helper - vfs_ustat()	Al Viro	2012-01-04	3	-16/+15
\| \| \| \| \| \| \|	... and bury user_get_super()/statfs_by_dentry() - they are purely internal now. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: live vfsmounts never have NULL ->mnt_sb	Al Viro	2012-01-04	1	-1/+1
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: for usbfs, etc. internal vfsmounts ->mnt_sb->s_root == ->mnt_root	Al Viro	2012-01-04	3	-3/+3
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: pipe.c is really non-modular	Al Viro	2012-01-04	1	-7/+0
\| \| \| \| \| \| \|	... so no exitcalls there. Not much would work if pipe(2) would stop working, after all... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: fix the stupidity with i_dentry in inode destructors	Al Viro	2012-01-04	48	-50/+1
\| \| \| \| \| \| \| \| \| \|	Seeing that just about every destructor got that INIT_LIST_HEAD() copied into it, there is no point whatsoever keeping this INIT_LIST_HEAD in inode_init_once(); the cost of taking it into inode_init_always() will be negligible for pipes and sockets and negative for everything else. Not to mention the removal of boilerplate code from ->destroy_inode() instances... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: mnt_drop_write_file()	Al Viro	2012-01-04	21	-48/+54
\| \| \| \| \| \| \|	new helper (wrapper around mnt_drop_write()) to be used in pair with mnt_want_write_file(). Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	constify seq_file stuff	Al Viro	2012-01-04	1	-5/+5
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: make do_kern_mount() static	Al Viro	2012-01-04	1	-2/+1
\| \| \| \| \| \|	the only user outside of fs/namespace.c has died Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: convert fs_supers to hlist	Al Viro	2012-01-04	2	-13/+14
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	make nfs_follow_remote_path() handle ERR_PTR() passed as root_mnt	Al Viro	2012-01-04	1	-9/+9
\| \| \| \| \| \|	... rather than duplicating that in callers Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: kill ->mnt_devname use in afs printks	Al Viro	2012-01-04	1	-2/+2
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	btrfs, nfs, apparmor: don't pull mnt_namespace.h for no reason...	Al Viro	2012-01-04	2	-2/+0
\| \| \| \| \| \| \| \|	it's not needed anymore; we used to, back when we had to do mount_subtree() by hand, complete with put_mnt_ns() in it. No more... Apparmor didn't need it since the __d_path() fix. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: dentry_reset_mounted() doesn't use vfsmount argument	Al Viro	2012-01-04	1	-3/+3
\| \| \| \| \| \|	lose it Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	unexport put_mnt_ns(), make create_mnt_ns() static outright	Al Viro	2012-01-04	1	-3/+1
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: add missing parens in pnode.h macros	Al Viro	2012-01-04	1	-5/+5
\| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: more mnt_parent cleanups	Al Viro	2012-01-04	4	-55/+29
\| \| \| \| \| \| \| \| \| \| \| \|	a) mount --move is checking that ->mnt_parent is non-NULL before looking if that parent happens to be shared; ->mnt_parent is never NULL and it's not even an misspelled !mnt_has_parent() b) pivot_root open-codes is_path_reachable(), poorly. c) so does path_is_under(), while we are at it. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: new internal helper: mnt_has_parent(mnt)	Al Viro	2012-01-04	5	-12/+18
\| \| \| \| \| \| \| \| \| \| \| \|	vfsmounts have ->mnt_parent pointing either to a different vfsmount or to itself; it's never NULL and termination condition in loops traversing the tree towards root is mnt == mnt->mnt_parent. At least one place (see the next patch) is confused about what's going on; let's add an explicit helper checking it right way and use it in all places where we need it. Not that there had been too many, but... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	vfs: kill pointless helpers in namespace.c	Al Viro	2012-01-04	1	-30/+5
\| \| \| \| \| \| \|	mnt_{inc,dec}_count() is not cleaner than doing the corresponding mnt_add_count() directly and mnt_set_count() is not used at all. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	new helpers: fh_{want,drop}_write()	Al Viro	2012-01-04	3	-19/+29
\| \| \| \| \| \| \|	A bunch of places in nfsd does mnt_{want,drop}_write on vfsmount of export of given fhandle. Switched to obvious inlined helpers... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	switch a bunch of places to mnt_want_write_file()	Al Viro	2012-01-04	16	-42/+42
\| \| \| \| \| \|	it's both faster (in case when file has been opened for write) and cleaner. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	trim fs/internal.h	Al Viro	2012-01-04	5	-28/+15
\| \| \| \| \| \| \|	some stuff in there can actually become static; some belongs to pnode.h as it's a private interface between namespace.c and pnode.c... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	pull manipulations of rpc_cred inside alloc_nfs_open_context()	Al Viro	2012-01-04	2	-32/+21
\| \| \| \| \| \| \| \| \|	No need to duplicate them in both callers; make it return ERR_PTR(-ENOMEM) on allocation failure instead of NULL and it'll be able to report rpc_lookup_cred() failures just fine. Callers are much happier that way... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
*	Merge branch 'for-linus' of ↵	Linus Torvalds	2011-12-30	1	-26/+3
\|\ \| \| \| \| \| \| \| \| \| \| \| \|	git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client: ceph: disable use of dcache for readdir etc.
\| *	ceph: disable use of dcache for readdir etc.	Sage Weil	2011-12-29	1	-26/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Ceph attempts to use the dcache to satisfy negative lookups and readdir when the entire directory contents are in cache. Disable this behavior until lingering bugs in this code are shaken out; we'll re-enable these hooks once things are fully stable. Signed-off-by: Sage Weil <sage@newdream.net>
* \|	Merge branch 'for-linus' of git://oss.sgi.com/xfs/xfs	Linus Torvalds	2011-12-30	3	-25/+43
\|\ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* 'for-linus' of git://oss.sgi.com/xfs/xfs: xfs: log all dirty inodes in xfs_fs_sync_fs xfs: log the inode in ->write_inode calls for kupdate
\| * \|	xfs: log all dirty inodes in xfs_fs_sync_fs	Christoph Hellwig	2011-12-23	3	-24/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since Linux 2.6.36 the writeback code has introduces various measures for live lock prevention during sync(). Unfortunately some of these are actively harmful for the XFS model, where the inode gets marked dirty for metadata from the data I/O handler. The older_than_this checks that are now more strictly enforced since writeback: avoid livelocking WB_SYNC_ALL writeback by only calling into __writeback_inodes_sb and thus only sampling the current cut off time once. But on a slow enough devices the previous asynchronous sync pass might not have fully completed yet, and thus XFS might mark metadata dirty only after that sampling of the cut off time for the blocking pass already happened. I have not myself reproduced this myself on a real system, but by introducing artificial delay into the XFS I/O completion workqueues it can be reproduced easily. Fix this by iterating over all XFS inodes in ->sync_fs and log all that are dirty. This might log inode that only got redirtied after the previous pass, but given how cheap delayed logging of inodes is it isn't a major concern for performance. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Dave Chinner <dchinner@redhat.com> Tested-by: Mark Tinguely <tinguely@sgi.com> Reviewed-by: Mark Tinguely <tinguely@sgi.com> Signed-off-by: Ben Myers <bpm@sgi.com>
\| * \|	xfs: log the inode in ->write_inode calls for kupdate	Christoph Hellwig	2011-12-23	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the writeback code writes back an inode because it has expired we currently use the non-blockin ->write_inode path. This means any inode that is pinned is skipped. With delayed logging and a workload that has very little log traffic otherwise it is very likely that an inode that gets constantly written to is always pinned, and thus we keep refusing to write it. The VM writeback code at that point redirties it and doesn't try to write it again for another 30 seconds. This means under certain scenarious time based metadata writeback never happens. Fix this by calling into xfs_log_inode for kupdate in addition to data integrity syncs, and thus transfer the inode to the log ASAP. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Dave Chinner <dchinner@redhat.com> Tested-by: Mark Tinguely <tinguely@sgi.com> Reviewed-by: Mark Tinguely <tinguely@sgi.com> Signed-off-by: Ben Myers <bpm@sgi.com>
* \| \|	procfs: do not confuse jiffies with cputime64_t	Andreas Schwab	2011-12-30	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Commit 2a95ea6c0d129b4 ("procfs: do not overflow get_{idle,iowait}_time for nohz") did not take into account that one some architectures jiffies and cputime use different units. This causes get_idle_time() to return numbers in the wrong units, making the idle time fields in /proc/stat wrong. Instead of converting the usec value returned by get_cpu_{idle,iowait}_time_us to units of jiffies, use the new function usecs_to_cputime64 to convert it to the correct unit of cputime64_t. Signed-off-by: Andreas Schwab <schwab@linux-m68k.org> Acked-by: Michal Hocko <mhocko@suse.cz> Cc: Arnd Bergmann <arnd@arndb.de> Cc: "Artem S. Tashkinov" <t.artem@mailcity.com> Cc: Dave Jones <davej@redhat.com> Cc: Alexey Dobriyan <adobriyan@gmail.com> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: "Luck, Tony" <tony.luck@intel.com> Cc: Benjamin Herrenschmidt <benh@kernel.crashing.org> Cc: Martin Schwidefsky <schwidefsky@de.ibm.com> Cc: Heiko Carstens <heiko.carstens@de.ibm.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
* \| \|	vfs: fix handling of lock allocation failure in lease-break case	Linus Torvalds	2011-12-26	1	-8/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Bruce Fields notes that commit 778fc546f749 ("locks: fix tracking of inprogress lease breaks") introduced a possible error pointer dereference on failure to allocate memory. locks_conflict() will dereference the passed-in new lease lock structure that may be an error pointer. This means an open (without O_NONBLOCK set) on a file with a lease applied (generally only done when Samba or nfsd (with v4) is running) could crash if a kmalloc() fails. So instead of playing games with IS_ERROR() all over the place, just check the allocation failure early. That makes the code more straightforward, and avoids this possible bad pointer dereference. Based-on-patch-by: J. Bruce Fields <bfields@redhat.com> Cc: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>