1. What’s New in Transaction Management
Not much has really changed in SQL Server
2008 related to transactions, transaction logging, and transaction
management. About the only real change is the removal of the WITH TRUNCATE_ONLY and WITH NO_LOG options from the BACKUP LOG
command. These options are no longer available in SQL Server 2008 to
prune the transaction log. The alternative is to switch the database to
simple recovery model.
2. What Is a Transaction?
A transaction is one
or more SQL statements that must be completed as a whole or, in other
words, as a single logical unit of work. Transactions provide a way of
collecting and associating multiple actions into a single all-or-nothing
multiple-operation action. All operations within the transaction must
be fully completed or not performed at all.
Consider a bank transaction in which you move $1,000 from your checking account to your savings account. This transaction is, in fact, two
operations: a decrement of your checking account and an increment of
your savings account. Consider the impact on your finances if the bank’s
server went down after it completed the first step and never got to the
second! When the two operations are combined, as a transaction, they
either both succeed or both fail as a single, complete unit of work.
A transaction is a logical unit of work that has four special characteristics, known as the ACID properties:
Atomicity— Associated modifications are an all-or-nothing proposition; either all are done or none are done.
Consistency—
After a transaction finishes, all data is in the state it should be in,
all internal structures are correct, and everything accurately reflects
the transaction that has occurred.
Isolation— One transaction cannot interfere with the processes of another transaction.
Durability— After the transaction has finished, all changes made are permanent.
The responsibility for enforcing the ACID
properties of a transaction is split between T-SQL developers and SQL
Server. The developers are responsible for ensuring that the
modifications are correctly collected together and that the data is
going to be left in a consistent state that corresponds with the actions
being taken. SQL Server ensures that the transaction is isolated and
durable, undertakes the atomicity requested, and ensures the consistency
of the final data structures. The transaction log of each database
provides the durability for the transaction.
3. How SQL Server Manages Transactions
SQL Server uses the database’s transaction log to
record the modifications that occur within the database. Each log record
is labeled with a unique log sequence number (LSN), and all log entries
that are part of the same transaction are linked together so that they
can be easily located if the transaction needs to be undone or redone.
The primary responsibility of logging is to ensure transaction
durability—either ensuring that the completed changes make it to the
physical database files or ensuring that any unfinished transactions are
rolled back in the event of an error or a server failure.
What is logged? Obviously, the start and end of a
transaction are logged, but SQL Server also logs the actual data
modification, page allocations and de-allocations, and changes to
indexes. SQL Server keeps track of a number of pieces of information,
all with the aim of ensuring the ACID properties of the transaction.
After a transaction has been committed, it cannot be
rolled back. The only way to undo a committed transaction is to write
another transaction to reverse the changes made. A transaction can be
rolled back before it is committed, however.
SQL Server provides transaction management for all users, using the following components:
Transaction-control statements to define the logical units of work
A write-ahead transaction log
An automatic recovery process
Data-locking mechanisms to ensure consistency and transaction isolation
4. Defining Transactions
You can carry out transaction processing with Microsoft SQL Server in three ways:
AutoCommit—
Every T-SQL statement is its own transaction and automatically commits
when it finishes. This is the default mode in which SQL Server operates.
Explicit— This approach provides programmatic control of the transaction, using the BEGIN TRAN and COMMIT/ROLLBACK TRAN/WORK commands.
Implicit—
In this mode, when you issue certain SQL commands, SQL Server
automatically starts a transaction. You must finish the transaction by
explicitly issuing the COMMIT/ROLLBACK TRAN/WORK commands.
Each of these methods is discussed in the following sections.
Note
The terms for explicit and implicit transactions can
be somewhat confusing. The way to keep them straight is to think of how a
multistatement transaction is initiated, not how it is completed.
AutoCommit transactions are in a separate category because they are both
implicitly started and committed.
Implicit and explicit transactions have to be
explicitly ended, but explicit transactions must also be explicitly
started with the BEGIN TRAN statement, whereas no BEGIN TRAN is necessary to start a multistatement transaction when in implicit transaction mode.
AutoCommit Transactions
AutoCommit is the default transaction mode for SQL
Server. Each individual T-SQL command automatically commits or rolls
back its work at the end of its execution. Each SQL statement is
considered to be its own transaction, with begin and end control points
implied. Following is an example:
[implied begin transaction]
UPDATE account
SET balance = balance + 1000
WHERE account_no = "123456789"
[implied commit or rollback transaction]
If
an error is present in the execution of the statement, the action is
undone (that is, rolled back); if no errors occur, the action is
completed, and the changes are saved.
Assume that it is written as
follows in T-SQL:
declare @checking_account char(10),
@savings_account char(10)
select @checking_account = '0003456321',
@savings_account = '0003456322'
update account
set balance = balance - $1000
where account_number = @checking_account
update savings_account
set balance = balance + $1000
where account_number = @savings_account
What would happen if an error occurred in updating
the savings account? With AutoCommit, each statement is implicitly
committed after it completes successfully, so the update for the
checking account has already been committed. You would have no way of
rolling it back except to write another separate update to add the
$1,000 back to the account. If the system crashed during the updates,
how would you know which updates, if any, completed, and whether you
need to undo any of the changes because the subsequent commands were not
executed? You would need some way to group the two commands together as
a single logical unit of work so they can complete or fail as a whole.
SQL Server provides transaction control statements that allow you to
explicitly create multistatement user-defined transactions.
Explicit User-Defined Transactions
To have complete control of a transaction and define
logical units of work that consist of multiple data modifications, you
need to write explicit user-defined transactions. Any SQL Server user
can make use of the transaction control statements; no special
privileges are required.
To start a multistatement transaction, use the BEGIN TRAN command, which optionally takes a transaction name:
BEGIN TRAN[SACTION] [transaction_name [WITH MARK ['description']]]
The transaction name is essentially meaningless as
far as transaction management is concerned, and if transactions are
nested , the name is useful
only for the outermost BEGIN TRAN statement. Rolling back to
any other name, besides a savepoint name (savepoints are covered in the
next section), generates an error message similar to the following error
message and does not roll back the transaction:
Msg 6401, Level 16, State 1, Line 5
Cannot roll back t2. No transaction or savepoint of that name was found.
Naming transactions is really useful only when you use the WITH MARK option. If the WITH MARK option is specified, a transaction name must be specified. WITH MARK
allows for restoring a transaction log backup to a named mark in the
transaction log. (For more information on restoring database and log
backups. This option allows you to restore a
database to a known state or to recover a set of related databases to a
consistent state. However, you need to be aware that BEGIN TRAN records are written to the log only if an actual data modification occurs within the transaction.
You complete an explicit transaction by issuing either a COMMIT TRAN or COMMIT [WORK] statement, and you can undo an explicit transaction by using either ROLLBACK TRAN or ROLLBACK [WORK]. The syntax of these commands is as follows:
COMMIT [TRAN[SACTION] [transaction_name]] | [WORK]
ROLLBACK [TRAN[SACTION] [transaction_name | savepointname]] | [WORK]
The COMMIT statement marks the successful conclusion of a transaction. This statement can be coded as COMMIT, COMMIT WORK, or COMMIT TRAN. The only difference is that the first two versions are SQL-92 ANSI compliant.
The ROLLBACK statement unconditionally undoes all work done within the transaction. This statement can also be coded as ROLLBACK, ROLLBACK WORK, or ROLLBACK TRAN. The first two commands are ANSI-92 SQL compliant and do not accept user-defined transaction names. ROLLBACK TRAN is required if you want to roll back to a savepoint within a transaction.
The following example shows how you could code the previously mentioned banking example as a single transaction in SQL Server:
declare @checking_account char(10),
@savings_account char(10)
select @checking_account = '0003456321',
@savings_account = '0003456322'
begin tran
update account
set balance = balance - $1000
where account_number = @checking_account
if @@error != 0
begin
rollback tran
return
end
update savings_account
set balance = balance + $1000
where account_number = @savings_account
if @@error != 0
begin
rollback tran
return
end
commit tran
Certain
commands cannot be specified within a user-defined transaction,
primarily because they cannot be effectively rolled back in the event of
a failure. In most cases, because of their long-running nature, you
would not want them to be specified within a transaction anyway.
Following are the commands you cannot specify in a user-defined
transaction:
ALTER DATABASE
ALTER FULLTEXT CATALOG
ALTER FULLTEXT INDEX
BACKUP DATABASE
BACKUP LOG
CREATE DATABASE
CREATE FULLTEXT CATALOG
CREATE FULLTEXT INDEX
DROP DATABASE
DROP FULLTEXT CATALOG
DROP FULLTEXT INDEX
RESTORE DATABASE
RECONFIGURE
RESTORE LOG
UPDATE STATISTICS
Savepoints
A savepoint allows you to set a marker in a
transaction that you can roll back to undo a portion of the transaction
but commit the remainder of the transaction. The syntax is as follows:
SAVE TRAN[SACTION] savepointname
Savepoints are not ANSI-SQL 92 compliant, so you must
use the SQL Server–specific transaction management commands that allow
you to specify a named point within the transaction and then recover
back to it.
The following code illustrates the differences between the two types of syntax when using the SAVE TRAN command:
SQL-92 Syntax | SQL Server–Specific Syntax |
---|
BEGIN TRAN mywork
UPDATE table1...
SAVE TRAN savepoint1
INSERT INTO table2...
DELETE table3...
IF @@error = -1
ROLLBACK WORK
COMMIT WORK
| BEGIN TRAN mywork
UPDATE table1...
SAVE TRAN savepoint1
INSERT INTO table2...
DELETE table3...
IF @@error = -1
ROLLBACK TRAN savepoint1
COMMIT TRAN
|
Note the difference between the SQL-92 syntax on the
left and the SQL Server–specific syntax on the right. In the SQL-92
syntax, when you reach the ROLLBACK WORK command, the entire transaction is undone rather than undoing only to the point marked by the savepoint. You have to use the SQL Server–specific ROLLBACK TRAN
command and specify the savepoint name to roll back the work to the
savepoint and still be able to subsequently roll back or commit the rest
of the transaction.
Nested Transactions
As a rule, you can’t have more than one active
transaction per user session within SQL Server. However, suppose you
have a SQL batch that issues a BEGIN TRAN statement and then subsequently invokes a stored procedure, which also issues a BEGIN TRAN statement. Because you can have only one transaction active, what does the BEGIN TRAN inside the stored procedure accomplish? In SQL Server, this leads to an interesting anomaly referred to as nested transactions.
To determine whether transactions are open and how deep they are nested within a connection, you can use the global function @@trancount. If no transaction is active, the transaction nesting level is 0.
As a transaction is initiated, the transaction nesting level is
incremented; as a transaction completes, the transaction nesting is
decremented. The overall transaction remains open and can be entirely
rolled back until the transaction nesting level returns to 0.
You can use the @@trancount function to
monitor the current status of a transaction. For example, what would SQL
Server do when encountering the following transaction (which produces
an error because of the reference constraint on the titles table)?
use BIGPUBS2008
go
BEGIN TRAN
DELETE FROM publishers
WHERE pub_id = '0736'
go
Msg 547, Level 16, State 0, Line 2
The DELETE statement conflicted with the REFERENCE constraint
"FK__pub_info__pub_id__2BDE8E15". The conflict occurred in database
"bigpubs2008", table "dbo.pub_info", column 'pub_id'.
The statement has been terminated.
Is the transaction still active? You can find out by using the @@trancount function:
select @@trancount
go
-----------
1
In this case, @@trancount returns a value of
1, which indicates that the transaction is still open and in progress.
This means that you can still issue commands within the transaction and
commit the changes, or you can roll back the transaction. Also, if you
were to log out of the user session from SQL Server before the
transaction nesting level reached 0, SQL Server would automatically roll
back the transaction.
Although nothing prevents you from coding a BEGIN TRAN within another BEGIN TRAN,
doing so has no real benefit, even though such cases might occur.
However, if you nest transactions in this manner, you must execute a COMMIT statement for each BEGIN TRAN statement issued. The reason is that SQL Server modifies the @@trancount with each transaction statement and considers the transaction finished only when the transaction nesting level returns to 0. Table 1 shows the effects that transaction control statements have on @@trancount.
Table 1. Transaction Statements’ Effects on @@trancount
Statement | Effect on @@trancount |
---|
BEGIN TRAN | +1 |
COMMIT | −1 |
ROLLBACK | Sets to 0 |
SAVE TRAN savepoint | No effect |
ROLLBACK TRAN savepoint | No effect |
Following is a summary of how transactional control relates to the values reported by:
When you log in to SQL Server, the value of @@trancount for your session is initially 0.
Each time you execute begin transaction, SQL Server increments @@trancount.
Each time you execute commit transaction, SQL Server decrements @@trancount.
Actual work is committed only when @@trancount reaches 0 again.
When you execute ROLLBACK TRANSACTION, the transaction is canceled and @@trancount returns to 0. Notice that ROLLBACK TRANSACTION cuts straight through any
number of nested transactions, canceling the overall main transaction.
This means that you need to be careful how you write code that contains a
ROLLBACK statement. You need to be sure to check for the
return status up through all levels and exit accordingly so you don’t
continue executing data modifications that were meant to be part of the
larger overall transaction.
Setting savepoints and rolling back to a savepoint do not affect @@trancount or transaction nesting in any way.
If a user connection is lost for any reason when @@trancount is greater than 0,
any pending work for that connection is automatically rolled back. SQL
Server requires that multistatement transactions be explicitly
committed.
Because the BEGIN TRAN statement increments @@trancount, each BEGIN TRAN statement must be paired with a COMMIT for the transaction to complete successfully.
Let’s look at some sample code to see the values of @@trancount as the transaction progresses. This first example is a simple explicit transaction with a nested BEGIN TRAN:
SQL Statement | @@trancount Value |
---|
SELECT "Starting....."
BEGIN TRAN
DELETE FROM table1
BEGIN TRAN
INSERT INTO table2
COMMIT
UPDATE table3
COMMIT
|
|
Transactions are nested syntactically only. The only commit tran statement that has an impact on real data is the last one, the statement that returns @@trancount to 0. That statement fully commits the work done by the initial and nested transactions. Until that final COMMIT TRAN is encountered, all the work can be rolled back with a ROLLBACK statement.
As a general rule of thumb, if a transaction is already active, you shouldn’t issue another BEGIN TRAN statement. You should check the value of @@trancount
to determine whether a transaction is already active. If you want to be
able to roll back the work performed within a nested transaction
without rolling back the entire transaction, you can set a savepoint
instead of issuing a BEGIN TRAN statement.
Implicit Transactions
AutoCommit transactions and explicit user-defined
transactions, which are the default transaction mode in SQL Server 2008,
are not ANSI-92 SQL compliant. The ANSI-92 SQL standard states that any
data retrieval or modification statement issued should implicitly begin
a multistatement transaction that remains in effect until an explicit ROLLBACK or COMMIT statement is issued. Microsoft refers to this transation mode as IMPLICIT_TRANSACTIONS.
To enable implicit transactions for a connection in SQL Server 2008, you need to enable the IMPLICIT_TRANSACTIONS session setting using the following command:
SET IMPLICIT_TRANSACTIONS ON
After this option is turned on, transactions are
implicitly started, if they are not already in progress, whenever any of
the following commands are executed:
ALTER TABLE
CREATE
DELETE
DROP
FETCH
GRANT
INSERT
OPEN
REVOKE
SELECT
TRUNCATE TABLE
UPDATE
Note that neither the ALTER VIEW nor ALTER PROCEDURE statement starts an implicit transaction.
You must explicitly complete implicit transactions by issuing a COMMIT or ROLLBACK;
a new transaction is started again on the execution of any of the
preceding commands. If you plan to use implicit transactions, the main
issue to be aware of is that locks are held until you explicitly commit
the transaction. This can cause problems with concurrency and the
system’s capability to truncate the transaction log.
Even when using implicit transactions, you can still issue the BEGIN TRAN statement and create transaction nesting. In the following example, IMPLICIT_TRANSACTIONS ON is turned on to see the effect this has on the value of @@trancount.
SQL Statements | @@trancount Value |
---|
SET IMPLICIT_TRANSACTIONS ON | 0 |
go | 0 |
INSERT INTO table1 | 1 |
UPDATE table2 | 1 |
COMMIT | 0 |
go | |
SELECT * FROM table1 | 1 |
BEGIN TRAN | 2 |
DELETE FROM table1 | 2 |
COMMIT | 1 |
go | |
DROP TABLE table1 | 1 |
COMMIT | 0 |
As you can see in this example, if a BEGIN TRAN is issued while a transaction is still active, transaction nesting occurs, and a second COMMIT is required to finish the transaction. The main difference between this example and the preceding one is that here, a BEGIN TRAN is not required to start the transaction. The first INSERT statement initiates the transaction. When you are running in implicit transaction mode, you don’t need to issue a BEGIN TRAN statement; in fact, you should avoid doing so to prevent transaction nesting and the need for multiple commits.
The following example shows the previous banking transaction using implicit transactions:
set implicit_transactions on
go
declare @checking_account char(10),
@savings_account char(10)
select @checking_account = '0003456321',
@savings_account = '0003456322'
update account
set balance = balance - $1000
where account_number = @checking_account
if @@error != 0
begin
rollback
return
end
update savings_account
set balance = balance + $1000
where account_number = @savings_account
if @@error != 0
begin
rollback
return
end
commit
This example is nearly identical to the explicit transaction example except for the lack of a BEGIN TRAN
statement. In addition, when in implicit transaction mode, you cannot
roll back to a named transaction because no name is assigned when the
transaction is invoked implicitly. You can, however, still set
savepoints and roll back to savepoints to partially roll back work
within an implicit transaction.
Tip
If you need to know within your SQL code whether implicit transactions are enabled so you can avoid issuing explicit BEGIN TRAN statements, you can check the @@options function. @@options
returns a bitmap that indicates which session-level options are enabled
for the current session. If bit 2 is on, implicit transactions are
enabled. The following code snippet can be used in stored procedures or
SQL batches to check this value and decide whether to issue a BEGIN TRAN statement:
if @@options & 2 != 2 — if bit 2 is not turned on
BEGIN TRAN —a begin tran can be issued since implicit transactions
are off
...
Implicit Transactions Versus Explicit Transactions
When would you want to use implicit transactions
versus explicit transactions? If you are porting an application from
another database environment, such as DB2 or Oracle, that uses implicit
transactions, that application converts over to SQL Server more easily
and with fewer code changes if you run in implicit transaction mode.
Also, if the application you are developing needs to be ANSI compliant
and run across multiple database platforms with minimal code changes,
you might want to use implicit transactions.
If you use implicit transactions in your applications, you need to be sure to issue COMMIT
statements as frequently as possible to prevent leaving transactions
open and holding locks for an extended period of time, which can have an
adverse impact on concurrency and overall system performance.
If an application is going to be hosted only
on SQL Server, it is recommended that you use AutoCommit and explicit
transactions so that changes are committed as quickly as possible and so
that only those logical units of work that are explicitly defined
contain multiple commands within a transaction.