electrumx/README.rst

ElectrumX - Reimplementation of Electrum-server
===============================================
::

  Licence: MIT Licence
  Author: Neil Booth
  Language: Python (>=3.5)


Motivation
==========

For privacy and other reasons, I have long wanted to run my own
Electrum server, but for reasons I cannot remember I struggled to set
it up or get it to work on my DragonFlyBSD system, and I lost interest
for over a year.

More recently I heard that Electrum server databases were around 35GB
in size when gzipped, and had sync times from Genesis of over a week
(and sufficiently painful that no one seems to have done one for a
long time) and got curious about improvements.  After taking a look at
the existing server code I decided to try a different approach.

I prefer Python3 over Python2, and the fact that Electrum is stuck on
Python2 has been frustrating for a while.  It's easier to change the
server to Python3 than the client.

It also seemed like a good way to learn about asyncio, which is a
wonderful and powerful feature of Python from 3.4 onwards.
Incidentally asyncio would also make a much better way to implement
the Electrum client.

Finally though no fan of most altcoins I wanted to write a codebase
that could easily be reused for those alts that are reasonably
compatible with Bitcoin.  Such an abstraction is also useful for
testnets, of course.


Implementation
==============

ElectrumX does not currently do any pruning.  With luck it may never
become necessary.  So how does it achieve a much more compact database
than Electrum server, which prunes a lot of hisory, and also sync
faster?

All of the following likely play a part:

- aggressive caching and batching of DB writes
- more compact representation of UTXOs, the address index, and
  history.  Electrum server stores full transaction hash and height
  for all UTXOs.  In its pruned history it does the same.  ElectrumX
  just stores the transaction number in the linear history of
  transactions.  For at least another 5 years the transaction number
  will fit in a 4-byte integer.  ElectrumX calculates the height from
  a simple lookup in a linear array which is stored on disk.
  ElectrumX also stores transaction hashes in a linear array on disk.
- storing static append-only metadata which is indexed by position on
  disk rather than in levelDB.  It would be nice to do this for histories
  but I cannot think how they could be easily indexable on a filesystem.
- avoiding unnecessary or redundant computations
- more efficient memory usage
- asyncio and asynchronous prefetch of blocks.  ElectrumX should not
  have any need of threads.


Roadmap
=======

- test a few more performance improvement ideas
- handle client connections (half-implemented but not functional)
- potentially move some functionality to C or C++

Once I get round to writing the server part, I will add DoS
protections if necessary to defend against requests for large
histories.  However with asyncio it would not surprise me if ElectrumX
could smoothly serve the whole history of the biggest Satoshi dice
address with minimal negative impact on other connections; we shall
have to see.  If the requestor is running Electrum client I am
confident that it would collapse under the load far more quickly that
the server would; it is very inefficient at handling large wallets
and histories.


Database Format
===============

The database and metadata formats of ElectrumX are very likely to
change in the future which will render old DBs unusable.  For now I do
not intend to provide converters as the rate of flux is high.


Miscellany
==========

As I've been researching where the time is going during block chain
indexing and how various cache sizes and hardware choices affect it,
I'd appreciate it if anyone trying to synchronize could tell me::

  - their O/S and filesystem
  - their hardware (CPU name and speed, RAM, and disk kind)
  - whether their daemon was on the same host or not
  - whatever stats about sync height vs time they can provide (the
    logs give it all in wall time)
  - the network (e.g. bitcoin mainnet) they synced


Neil Booth
kyuupichan@gmail.com
https://github.com/kyuupichan
1BWwXJH3q6PRsizBkSGm2Uw4Sz1urZ5sCj
Initial revision 8 years ago			`ElectrumX - Reimplementation of Electrum-server`
			`===============================================`
			`::`

			`Licence: MIT Licence`
			`Author: Neil Booth`
			`Language: Python (>=3.5)`


			`Motivation`
			`==========`

			`For privacy and other reasons, I have long wanted to run my own`
			`Electrum server, but for reasons I cannot remember I struggled to set`
			`it up or get it to work on my DragonFlyBSD system, and I lost interest`
			`for over a year.`

			`More recently I heard that Electrum server databases were around 35GB`
			`in size when gzipped, and had sync times from Genesis of over a week`
			`(and sufficiently painful that no one seems to have done one for a`
			`long time) and got curious about improvements. After taking a look at`
			`the existing server code I decided to try a different approach.`

			`I prefer Python3 over Python2, and the fact that Electrum is stuck on`
			`Python2 has been frustrating for a while. It's easier to change the`
			`server to Python3 than the client.`

			`It also seemed like a good way to learn about asyncio, which is a`
			`wonderful and powerful feature of Python from 3.4 onwards.`
			`Incidentally asyncio would also make a much better way to implement`
			`the Electrum client.`

			`Finally though no fan of most altcoins I wanted to write a codebase`
			`that could easily be reused for those alts that are reasonably`
			`compatible with Bitcoin. Such an abstraction is also useful for`
			`testnets, of course.`


			`Implementation`
			`==============`

			`ElectrumX does not currently do any pruning. With luck it may never`
			`become necessary. So how does it achieve a much more compact database`
Update notes etc. 8 years ago			`than Electrum server, which prunes a lot of hisory, and also sync`
			`faster?`
Initial revision 8 years ago
			`All of the following likely play a part:`

Update notes etc. 8 years ago			`- aggressive caching and batching of DB writes`
Update text 8 years ago			`- more compact representation of UTXOs, the address index, and`
Initial revision 8 years ago			`history. Electrum server stores full transaction hash and height`
			`for all UTXOs. In its pruned history it does the same. ElectrumX`
			`just stores the transaction number in the linear history of`
Update text 8 years ago			`transactions. For at least another 5 years the transaction number`
			`will fit in a 4-byte integer. ElectrumX calculates the height from`
			`a simple lookup in a linear array which is stored on disk.`
			`ElectrumX also stores transaction hashes in a linear array on disk.`
Initial revision 8 years ago			`- storing static append-only metadata which is indexed by position on`
			`disk rather than in levelDB. It would be nice to do this for histories`
			`but I cannot think how they could be easily indexable on a filesystem.`
			`- avoiding unnecessary or redundant computations`
Update notes etc. 8 years ago			`- more efficient memory usage`
Update text 8 years ago			`- asyncio and asynchronous prefetch of blocks. ElectrumX should not`
			`have any need of threads.`
Update notes etc. 8 years ago

			`Roadmap`
			`=======`

			`- test a few more performance improvement ideas`
Update text 8 years ago			`- handle client connections (half-implemented but not functional)`
Update notes etc. 8 years ago			`- potentially move some functionality to C or C++`
Initial revision 8 years ago
			`Once I get round to writing the server part, I will add DoS`
			`protections if necessary to defend against requests for large`
			`histories. However with asyncio it would not surprise me if ElectrumX`
			`could smoothly serve the whole history of the biggest Satoshi dice`
			`address with minimal negative impact on other connections; we shall`
			`have to see. If the requestor is running Electrum client I am`
			`confident that it would collapse under the load far more quickly that`
Fix an incorrect comment, and a typo 8 years ago			`the server would; it is very inefficient at handling large wallets`
Initial revision 8 years ago			`and histories.`


			`Database Format`
			`===============`

			`The database and metadata formats of ElectrumX are very likely to`
Improve the leveldb flush; it should be a lot faster now. More useful logging and stats. 8 years ago			`change in the future which will render old DBs unusable. For now I do`
			`not intend to provide converters as the rate of flux is high.`
Initial revision 8 years ago

			`Miscellany`
			`==========`

			`As I've been researching where the time is going during block chain`
			`indexing and how various cache sizes and hardware choices affect it,`
Fix some typos 8 years ago			`I'd appreciate it if anyone trying to synchronize could tell me::`
Initial revision 8 years ago
			`- their O/S and filesystem`
			`- their hardware (CPU name and speed, RAM, and disk kind)`
			`- whether their daemon was on the same host or not`
			`- whatever stats about sync height vs time they can provide (the`
			`logs give it all in wall time)`
Update text 8 years ago			`- the network (e.g. bitcoin mainnet) they synced`
Initial revision 8 years ago

			`Neil Booth`
			`kyuupichan@gmail.com`
			`https://github.com/kyuupichan`
			`1BWwXJH3q6PRsizBkSGm2Uw4Sz1urZ5sCj`