Thursday, December 15, 2011

RAC Interview Questions

1.       What is RAC?

RAC stands for Real Application cluster. It is a clustering solution from Oracle Corporation that ensures high availability of databases by providing instance failover, media failover features.

2.       Mention the Oracle RAC software components:-

Oracle RAC is composed of two or more database instances. They are composed of Memory structures and background processes same as the single instance database. Oracle RAC instances use two processes GES(Global Enqueue Service), GCS(Global Cache Service) that enable cache fusion. Oracle RAC instances are composed of following background processes:

a.       ACMS—Atomic Controlfile to Memory Service (ACMS)

b.       GTX0-j—Global Transaction Process

c.       LMON—Global Enqueue Service Monitor

d.       LMD—Global Enqueue Service Daemon

e.       LMS—Global Cache Service Process

f.        LCK0—Instance Enqueue Process

g.       RMSn—Oracle RAC Management Processes (RMSn)

h.       RSMN—Remote Slave Monitor

3.       What is GRD?

GRD stands for Global Resource Directory. The GES and GCS maintain records of the statuses of each datafile and each cached block using global resource directory. This process is referred to as cache fusion and helps in data integrity.

4.       Give Details on Cache Fusion:-

Oracle RAC is composed of two or more instances. When a block of data is read from datafile by an instance within the cluster and another instance is in need of the same block, it is easy to get the block image from the instance which has the block in its SGA rather than reading from the disk. To enable inter instance communication Oracle RAC makes use of interconnects. The Global Enqueue Service (GES) monitors and Instance enqueue process manages the cache fusion

5.       Give Details on ACMS:-

ACMS stands for Atomic Controlfile Memory Service. In an Oracle RAC environment ACMS is an agent that ensures a distributed SGA memory update(ie)SGA updates are globally committed on success or globally aborted in event of a failure.

6.       Give details on GTX0-j :-

The process provides transparent support for XA global transactions in a RAC environment. The database auto tunes the number of these processes based on the workload of XA global transactions.

7.       Give details on LMON:-

This process monitors global enques and resources across the cluster and performs global enqueue recovery operations. This is called as Global Enqueue Service Monitor.

8.       Give details on LMD:-

This process is called as global enqueue service daemon. This process manages incoming remote resource requests within each instance.

9.       Give details on LMS:-

This process is called as Global Cache service process. This process maintains statuses of datafiles and each cached block by recording information in a Global Resource Directory (GRD).This process also controls the flow of messages to remote instances and manages global data block access and transmits block images between the buffer caches of different instances. This processing is a part of cache fusion feature.

10.   Give details on LCK0:-

This process is called as Instance enqueue process. This process manages non-cache fusion resource requests such as library and row cache requests.

11.   Give details on RMSn:-

This process is called as Oracle RAC management process. These processes perform manageability tasks for Oracle RAC. Tasks include creation of resources related Oracle RAC when new instances are added to the cluster.

12.   Give details on RSMN:-

This process is called as Remote Slave Monitor. This process manages background slave process creation and communication on remote instances. This is a background slave process. This process performs tasks on behalf of a co-coordinating process running in another instance.

13.   What components in RAC must reside in shared storage?

All datafiles, controlfiles, SPFIles, redo log files must reside on cluster-aware shred storage.

14.   What is the significance of using cluster-aware shared storage in an Oracle RAC environment?

All instances of an Oracle RAC can access all the datafiles, control files, SPFILE's, redologs files when these files are hosted out of cluster-aware shared storage which are group of shared disks.

15.   Give few examples for solutions that support cluster storage:-

ASM (automatic storage management),raw disk devices, network file system(NFS), OCFS2 and OCFS(Oracle Cluster Fie systems).

16.   What is an interconnect network?

An interconnect network is a private network that connects all of the servers in a cluster. The interconnect network uses a switch/multiple switches that only the nodes in the
cluster can access.

17.   How can we configure the cluster interconnect?

Configure User Datagram Protocol (UDP) on Gigabit Ethernet for cluster interconnects. On UNIX and Linux systems we use UDP and RDS (Reliable data socket) protocols to be used by Oracle Clusterware. Windows clusters use the TCP protocol.

18.   Can we use crossover cables with Oracle Clusterware interconnect?

No, crossover cables are not supported with Oracle Clusterware interconnects.

19.   What is the use of cluster interconnecting?

Cluster interconnect is used by the Cache fusion for inter instance communication.

The Oracle Notification Service (ONS) daemon is an daemon started by the CRS clusterware as part of the nodeapps. There is one ons daemon started per clustered node.

The Oracle Notification Service daemon receive a subset of published clusterware events via the local evmd and racgimon clusterware daemons and forward those events to application subscribers and to the local listeners. Users can access a RAC database using a client/server configuration or through one or more middle tiers, with or without connection pooling. Users can use oracle services feature to connect to database.

20.   What is the use of a service in Oracle RAC environment?

Applications should use the services feature to connect to the Oracle database. Services enable us to define rules and characteristics to control how users and applications connect to database instances.

21.   What are the characterizes controlled by Oracle services feature?

The characteristics include a unique name, workload balancing and failover options, and high availability characteristics.

22.   Which enable the load balancing of applications in RAC?

Oracle Net Services enable the load balancing of application connections across all of the instances in an Oracle RAC database.

23.   What is a virtual IP address or VIP?

A virtual IP address or VIP is an alternate IP address that the client connections use instead of the standard public IP address. To configure VIP address, we need to reserve a spare IP address for each node, and the IP addresses must use the same subnet as the public network.

24.   What is the use of VIP?

If a node fails, then the node's VIP address fails over to another node on which the VIP address can accept TCP connections but it cannot accept Oracle connections.

25.   Give situations under which VIP address failover happens:-

VIP addresses failover happens when the node on which the VIP address runs fails, all interfaces for the VIP address fails, all interfaces for the VIP address are disconnected from the network.

26.   What is the significance of VIP address failover?

When a VIP address failover happens, Clients that attempt to connect to the VIP address receive a rapid connection refused error .They don't have to wait for TCP connection timeout messages.

27.   Give the usage of srvctl

a.       srvctl start instance -d db_name -i "inst_name_list" [-o start_options]

b.       srvctl stop instance -d name -i "inst_name_list" [-o stop_options]

c.       srvctl stop instance -d orcl -i "orcl3,orcl4" -o immediate

d.       srvctl start database -d name [-o start_options]

e.       srvctl stop database -d name [-o stop_options]

f.        srvctl start database -d orcl -o mount

28.   What are the major RAC wait events?

In a RAC environment the buffer cache is global across all instances in the cluster and hence the processing differs. The most common wait events related to this are gc cr request and gc buffer busy.

a.       GC CR request: the time it takes to retrieve the data from the remote cache.  Reason: RAC Traffic Using Slow Connection or Inefficient queries (poorly tuned queries will increase the amount of data blocks requested by an Oracle session. The more blocks requested typically means the more often a block will need to be read from a remote instance via the interconnect.)

b.       GC BUFFER BUSY: It is the time the remote instance locally spends accessing the requested data block.

29.   How can I verify that RAC instances are running?

Issue the following query from any one node connecting through SQL*PLUS.

$connect sys/sys as sysdba

SQL>select * from V$ACTIVE_INSTANCES;

The query gives the instance number under INST_NUMBER column, host_: instance name under INST_NAME column.

30.   What is FAN?

Fast application Notification as it abbreviates to FAN relates to the events related to instances, services and nodes. This is a notification mechanism that Oracle RAC uses to notify other processes about the configuration and service level information that includes service status changes such as, UP or DOWN events. Applications can respond to FAN events and take immediate action.

31.   What is rolling upgrade?

It is a new ASM feature from Database 11g.ASM instances in Oracle database 11g release (from 11.1) can be upgraded or patched using rolling upgrade feature. This enables us to patch or upgrade ASM nodes in a clustered environment without affecting database availability. During a rolling upgrade we can maintain a functional cluster while one or more of the nodes in the cluster are running in different software versions.

32.   Mention the components of Oracle Clusterware.

Oracle clusterware is made up of components like voting disk and Oracle Cluster Registry (OCR)

33.   Name some Oracle clusterware tools and their uses?

a.       OIFCFG - allocating and de-allocating network interfaces

b.       OCRCONFIG - Command-line tool for managing Oracle Cluster Registry

c.       OCRDUMP - Identify the interconnect being used

d.       CVU - Cluster verification utility to get status of CRS resources

34.   What are the types of load-balancing and difference?

There are two types of connection load-balancing: server-side load balancing and client-side load balancing. Client-side balancing happens at client side where load balancing is done using listener. In case of server-side load balancing listener uses a load-balancing advisory to redirect connections to the instance providing best service.

35.   What are the 3 greatest benefits that RAC provide?

a.       Availability

b.       Scalability – Add CPU, disk, memory along with nodes

c.       Ability to user low cost commodity hardware.

36.   How do you backup OCR?

There is an automatic backup mechanism for OCR. The default location is: $ORA_CRS_HOME/cdata/"clustername"/

To display backups:

 # ocrconfig –showbackup

To restore a backup:

# ocrconfig –restore

With Oracle RAC 10g Release 2 or later, you can also use the export command:

# ocrconfig -export -s online, and use -import option to restore the contents back.

With Oracle RAC 11g Release 1, you can do a manually backup of the OCR with the command:
# ocrconfig -manualbackup

37.   How do you backup voting disk?

# dd if=voting_disk_name of=backup_file_name

38.   How do I identify the voting disk location?

# crsctl query css votedisk

39.   How do I identify the OCR file location?

Check /var/opt/oracle/ocr.loc or /etc/ocr.loc (depends upon platform)

Or
# ocrcheck

40.   What is the purpose of the ONS deamon?

The Oracle Notification Service (ONS) daemon is an daemon started by the CRS clusterware as part of the nodeapps. There is one ons daemon started per clustered node.

The Oracle Notification Service daemon receives a subset of published clusterware events via the local evmd and racgimon clusterware daemons and forward those events to application subscribers and to the local listeners.

41.   What is Oracle Clusterware?

It provides the basic clustering support at OS level and enables oracle software to run in clustering more. It support up to 64 nodes. Oracle Clusterware can runs as a standalone cluster service or with a vendor supplied clusterware such as SunCluster or TrueCluster

42.   What is OCR?

Is the central repository for the CRS and keeps the details of the services and status of the resources. It should be storage and should be accessible to all nodes in the clusters simultaneously. It is the integral component of RAC

43.   What is voting disk?

It is used as a central reference for all the nodes and keeps the heartbeat information between nodes. It is a shared disk that will be accessed by all the members’ nodes in the cluster during the operations. It also called ‘Quorum Device”

44.   Steps to Install RAC

a.       OS configuration for the clusterware and RAC software, each server has to have Public IP and Private IP.

                                                              i.      Configuring network files /etc/hosts

                                                            ii.      Setup groups and users

                                                          iii.      Configure Kernel parameters

                                                           iv.      Configure the hangcheck-timer module (monitor linux kernel for extended operating system hangs)

b.       Setup the shared storage

                                                              i.      Configure raw devices /etc/systemconfig/rawdevices

c.       Establish user equivalence between nodes

d.        

                                                              i.       

45.    

a.        

 

Component

Description

Share Storage

SGA

Each instance has its own SGA

N/A

Background process

Each instance has its own BP

N/A

Data files

Shared by all instances, so must be placed in shared storage.

YES

Control files

Shared by all instances, so must be placed in shared storage.

YES

Online Redologs

Only one instance can write, but others instances can read during the recovery and archiving. If an instance is shutdown log switches by others instances can force the idle instance redologs to be archived.

YES

Archived Redologs

Private to the instance, but others instances will need access to all required archive logs during media recovery

NO

Flash Recovery Logs

Shared by all instances, so must be placed and shared storage

YES

Alert log and Trace

Private to each instance, other instances never read/write those files

NO

ORACLE_HOME

Same as single instance plus can be placed on shared storage file system, allowing a common ORACLE_HOME for all instances in an RAC environment.

YES/NO

 

No comments:

Post a Comment