Friday, May 31, 2013

NFS Mount Point disconnect Causes Instance to Hang on AIX

Recently we saw an issue with slow sqlplus login response time when we had a NFS file system mount disconnected due to a network issue in AIX environment. Instead of regular subsecond response, it took more than 5 minutes to complete the login when we had this issue.

This is an issue with the way in which the system call getcwd is implemented within AIX.
As long as the NFS mount point has at least one other parent directory besides the root directory,
this problem will not occur, regardless of whether the remote file system is reachable or not.

For example, if your current NFS mount point is called /oracle_depot, the fix would be to rename the mount point to something like /nfs/oracle_depot.

# unmount /oracle_depot
# mkdir /nfs
# mv /oracle_depot /nfs
# mount nfsmaster01:/oracle_share /oracle_depot/nfs
Be sure to make a similar configuration change within smit, so that it will survive a reboot.

No comments:

Post a Comment