Understanding the Problem

One of the most challenging issues with Apache Derby is database corruption, which can manifest as:

  • Unexpected SQLNonTransientConnectionException errors.
  • Data inconsistency when running queries.
  • Issues when starting the database, such as Database not found errors.

These issues often stem from improper shutdown procedures, disk failures, or concurrency conflicts in multi-threaded applications.

Root Causes and Diagnosis

Improper Shutdown

Apache Derby requires a clean shutdown using:

DriverManager.getConnection("jdbc:derby:;shutdown=true");

If this step is skipped, especially in embedded mode, the database can remain in an inconsistent state.

Uncommitted Transactions

Derby uses write-ahead logging, but uncommitted transactions during a crash can cause corruption. Check transaction logs for uncommitted changes:

SELECT XID, STATUS FROM SYSCS_DIAG.TRANSACTION_TABLE;

Multi-Threading Issues

Apache Derby is not thread-safe by default. If accessed concurrently without proper synchronization, data corruption may occur. Ensure safe multi-threaded access by using a connection pool like Apache Commons DBCP.

Fixing and Preventing Database Corruption

Running Database Recovery

If corruption is detected, restart Derby in recovery mode:

java -Dderby.system.home=/path/to/db org.apache.derby.tools.ij

Then run:

CALL SYSCS_UTIL.SYSCS_CHECKPOINT_DATABASE();

Enabling Auto-Recovery

Derby supports auto-recovery by setting:

derby.storage.recoverOnBoot=true

This ensures recovery on startup if an unclean shutdown occurred.

Using Connection Pools

To prevent concurrency issues, use a connection pool instead of direct JDBC connections:

BasicDataSource ds = new BasicDataSource();
ds.setUrl("jdbc:derby:myDB;create=true");
ds.setMinIdle(5);
ds.setMaxIdle(10);
ds.setMaxOpenPreparedStatements(100);

Regular Backups

Schedule regular database backups using:

CALL SYSCS_UTIL.SYSCS_BACKUP_DATABASE('/backup/location');

Conclusion

Database corruption in Apache Derby is often due to improper shutdowns, uncommitted transactions, and concurrency issues. Following best practices such as clean shutdowns, auto-recovery, connection pooling, and regular backups helps maintain database integrity.

FAQs

1. How do I know if my Apache Derby database is corrupted?

Check for startup errors, missing tables, or failed queries. Run SYSCS_UTIL.SYSCS_CHECKPOINT_DATABASE() to detect corruption.

2. Can I recover a corrupted Apache Derby database?

Yes. Restart Derby with recovery enabled and run a full backup restore if needed.

3. Why does my database fail to start after a crash?

Uncommitted transactions or an improper shutdown can leave Derby in an inconsistent state. Ensure that derby.storage.recoverOnBoot=true is set.

4. How can I prevent Derby from becoming unresponsive in multi-threaded applications?

Use a connection pool instead of managing direct connections, and avoid sharing a single connection across multiple threads.

5. What is the best way to back up Apache Derby?

Use the built-in system procedure SYSCS_UTIL.SYSCS_BACKUP_DATABASE() to create consistent backups.