A Crystallization Robot for Generating True Random Numbers Based on Stochastic Chemical Processes, Hacker News
2k Views
Highlights
) •
Automated system for obtaining statistical data on crystal formation
First example of generation of true random numbers using stochasticity of chemistry
• . Encryption capability better than Mersenne Twister pseudorandom number generator •
Analysis of entropy in different crystallizing compounds.
Progress and Potential . As chemistry and materials synthesis is starting to embrace an era of automation and the use of machine learning, it is becoming vital th at the quality and reliability of that data is assessed. By automating and parallelizing batch chemical reactions, enough samples may be run that statistical data can be obtained on the reaction system. We monitored the crystallization for hundreds of parallel reactions using a webcam and found that crystal features in the images obtained could be used to generate true random numbers. We also found that the approximate entropy of these numbers was different for different types of chemical reaction, and that the encryption capability of these numbers was greater than a commonly used pseudorandom number generator. This is the first time that stochasticity of chemistry has been investigated in large datasets from experimental data.
Summary (chemistry inherently involves a wide range of stochastic processes, yet chemists do not typically explore stochastic processes at the macroscale due to the difficulty in gathering data. We wondered whether it was possible to explore such processes, in this case crystallization, in a systematic way using an autonomous robotic platform. By performing inorganic reactions in an automated system, and observing the resultant occupied macrostate (crystallization images), we developed a powerful entropy source for generation of true random numbers. Randomness was confirmed using tests described by the National Institute for Standards and Technology (p uniformity>> 0. 18. Deficit from maximum approximate entropy was found be different between compounds (p
ANOVA (
Graphical Abstract )
Material Advancement Progression (MAP2: Benchmark)
Organic synthesis in a modular robotic system driven by a chemical programming language.
, 4
Coley CW
Thomas DA
Lummiss JAM
Jaworski JN
Breen CP
Schultz V.
Hart T.
Fishman JS
Rogers L.
Gao H.
et al.
A robotic platform for flow synthesis of organic compounds informed by AI planning.
(and and
(5)
Granda JM
Donina L.
Dragone V.
Long D.-L.
Cronin L.
Controlling an organic synthesis robot with machine learning to search for new reactivity.
, 6 (6)
Zhavoronkov A. Ivanenkov YA
Aliper A.
Veselov MS
Aladinskiy VA
Aladinskaya AV
Terentiev VA
Polykovskiy DA
Kuznetsov MD
Asadulaev A.
et al.
Deep learning enables rapid identification of potent DDR1 kinase inhibitors.
but also for exploration of data reliability. Another important aspect of exploring the extent to which experimental data are reproducible is the fact that many processes are intrinsically stochastic. Such stochasticity can also be useful, such as in the generation of random numbers.
Random numbers are used extensively in many applications, such as cryptography
(7) ()
(Baur S.)
Deppe C.
(Boche H.
Secure storage for identification; random resources and privacy leakage.
A stochastic SIS model driven by random diffusion of air pollutants.
where their non-deterministic properties and unpredictability are essential. The source of randomness (ie, the entropy source) is a crucial factor for whether correlations between generated numbers are present, and this has a great impact on the randomness and utility of the output.
(9)
Stipčević M.
Koç Ç.K.
True random number generators.
As such, random number generation not only can help us understand the reproducibility of a process, but also are more desirable than computational methods of number generation (eg, pseudorandom number generators) . This is because they extract their randomness from a physical system with a large available pool of entropy.
Noll, LC; Mende, R.G .; Sisodiya, S. .. (2014. Method for seeding a pseudo-random number generator with a cryptographic hash of a digitization of a chaotic system.US Patent US A, filed January , and granted March , .
Importantly, the generation of random outcomes is of profound importance as a source of noise, and potentially allows the generation of unanticipated data.
In a chemical system, each time a reaction is performed there is an almost infinite number of energetically equivalent ways for particular reagents to combine, resulting in both high uncertainty and entropy, and the exact pathway undertaken will n ever be repeated. The overall outcome of such a reaction is therefore an example of one specific state out of an almost infinite number of possible macrostates.
Thermodynamics and Statistical Mechanics (Classical Theoretical Physics).
As such, the entropy of such a chemical system is extraordinarily high,
Krivovichev SV
Which inorganic structures are the most complex?.
and may therefore serve as a very good entropy pool for application of random number generation.
In this regard, we hypothesized that one such system with a large entropy pool is that of compound formation and subsequent crystallization, where the detectable ensemble macrostates considered are the locations and morphologies of each crystal that has grown in a period of time as a result of these processes. To explore this idea, we set out to develop a fully automated system to not only do the chemical reactions but also grow crystals of the products using a camera as a detector. The platform was designed so that it converts these data into binary sequences, as shown schematically in (Figure 1) , which are assessed for randomness using the methods specified in National Institute for Standards and Technology ( NIST (special publication) – (a.
Evolution of oil droplets in a chemorobotic platform.
an additional set of motorized linear axes were attached to the underside of the device to support a camera on a mobile gantry. The mobility in the main CNC framework and auxiliary framework were controlled using technology originally designed for the open source “RepRap” 3D printer.
(Jones R.) Haufe P.
(Sells E.) Iravani P.
Olliver V.
Palmer C.
Bowyer A.
RepRap — the replicating rapid prototyper.
Reagent stock solutions were located adjacent. to the platform, and could be transferred to vials in the crystallization array using a combination of tubing and pumps.
Additional 3D-printed components were incorporated to direct the reagent outlets, fix the positions of the vials in a (x) (array, and support the camera) (Figures S1 – S8) ). Experiments consisted of pre-set volumes of stock solutions being pumped into each 024 – mL vial in a vial array, sequentially. The subsequent growth of crystals in each of the vials was recorded by a mobile camera at regular 24 – min intervals at a resolution of 1, (×) pixels. Image analysis using an object-detection and image-segmentation algorithm (Mask R-CNN)
Abdulla W.
Mask R-CNN for object detection and instance segmentati on on Keras and TensorFlow. GitHub repository.
Human versus robot in the discovery and crystallization of gigantic polyoxometalates.
The location, size, shape, orientation, and color of crystal formation within the vial were detected and taken as the entropy source for this system (see
Integrated 3D-printed reactionware for chemical synthesis and analysis.
hereafter referred to as {W
() ; and the synthesis and crystallization of the coordination cluster [Co4(2-pyridinemethanol)4(MeOH)4Cl4],
Yang E.-C.
Hendrickson DN
Wernsdorfer W.
Nakano M.
Zakharov LN
Sommer RD
Rheingold AL
Ledezma-Gairaud M.
Christou G.
Cobalt single-molecule magnet.
hereafter referred to as {Co 4 . Images of these crystallizations at different times are shown in (Figure 3) B and their chemical structures are shown in Figures S9 – S
. These reactions involve the chemical processes of (1), (1 and 2), and (1, 2, and 3), respectively. The synthetic procedure files are included in the online repository and were used to perform the experiments in a fully automated manner. Compound confirmation was obtained by performing single-crystal X-ray diffraction.
[W19Mn2O61Cl(SeO3)2(H2O)2] Results of the one-way ANOVA show that the samples are statistically different, w Hill Dunn’s test shows that all compounds are different from each other with order of increasing DEF
3 (being {W
},> CuSO
(4) , {Co
(4) }.
View Large Image
(Figure Viewer)
Download (PPT)
Finally, since random numbers are commonly used to encrypt data, we considered the encryption capability of this random number generator versus that of a frequently used pseudorandom number generator, the Mersenne Twister (MT).
78 Matsumoto M.
Nishimura T.
Mersenne twister: a
– dimensionally equidistributed uniform pseudo-random number generator.
Since the MT output is determined based on its internal state, knowledge of this state allows accurate prediction of future output . However, this is not possible in the case of the true random number generator, whose internal state is either (seemingly) non-existent
We gratefully acknowledge financial support from the EPSRC (grant nos. EP / P) (X / 1, EP / J 30024 / 1, EP / K / 1, EP / K / 1, EP / L 121759 / 1, and EP / L (/ 1) and ERC (project SMART -POM).
(Author Contributions) The idea was conceived by LC and developed by E.C.L. with help from J.M.P.-G., A.H., and E.K.B. E.C.L. built the robot, wrote the code, did the reactions, and analyzed the data with help from J, M.P.-G. and A.H. All the authors helped to write the manuscript. Declaration of Interests The authors declare no competing interests.
Data and Code Availability All experimental data are available from the authors upon request. Code for the binarization of images, sequence randomness testing, and figure preparation is located at (https://github.com/croningp/xtl-rng) .
Supplemental Information )
References Jia X.
Lynch A.
Huang Y.
Danielson M.
Lang’at I.
Milder A.
Ruby AE
Wang H.
Friedler SA
Norquist AJ
Schrier J.
Anthropogenic biases in chemical reaction data hinder exploratory inorganic synthesis. (Nature.) ; : –
(Dodge J.) Gururangan S.
(Card D.) Schwartz R.
Smith NA
Show your work : Improved reporting of experiment al results.
Organic synthesis in a modular robotic system driven by a chemical programming language. (Science.) ; : eaav
Coley CW
Thomas DA
Lummiss JAM
Jaworski JN
Breen CP
Schultz V.
Hart T.
Fishman JS
Rogers L.
Gao H.
et al.
A robotic platform for flow synthesis of organic compounds informed by AI planning. (Science.) 4001; : eaax
Granda JM
Donina L.
Dragone V.
Long D.-L.
Cronin L.
Controlling an organic synthesis robot with machine learning to search for new reactivity. Nature. ; : . –
Zhavoronkov A.
) Ivanenkov Y.A.
Aliper A.
Veselov MS
Aladinskiy VA
Aladinskaya AV
Terentiev VA
Polykovskiy DA
Kuznetsov MD
Asadulaev A.
et al.
Deep learning enables rapid identification of potent DDR1 kinase inhibitors. Nat. Biotechnol. ; :
(-)
(Baur S.) (Deppe C.
(Boche H.
) Secure storage for identification; random resources and privacy leakage.
IEEE Trans. Inf. Forensics Security. 2393;
: . –
He S.
Tang S.
(Wang W.
A stochastic SIS model driven by random diffusion of air pollutants.
(Physica A.)
;
:
.
Stipčević M.
Koç Ç.K.
True random number generators. in: Koç Ç.K. Open Problems in Mathematics and Computational Science. Springer International Publishing ,
: 346 – (Li P.) Wang Y.-C.
Zhang J.-Z.
All-optical fast random number generator.
Opt. Express. ; : – 670467 Noll, LC; Mende, R.G .; Sisodiya, S. .. (2014. Method for seeding a pseudo-random number generator with a cryptographic hash of a digitization of a chaotic system.US Patent US A, filed January , and granted March , .
Feistel R.
Distinguishing between Clausius, Boltzmann and Pauling entropies of frozen non-equilibrium states.
(Entropy.) 2393; :
Greiner W.
Neise L.
(Stocker H.)
Thermodynamics and Statistical Mechanics (Classical Theoreti cal Physics).
Second Edition. Springer ,
Krivovichev SV
which inorganic structures are the most complex?.
Angew. Chem. Int. Ed. ; : 823 –
(Rukhin A.) (Soto. J.
Nechvatal J.
Smid M.
Barker E.
(Leigh S.)
Levenson M.
(Vangel M.) Banks D.
Heckert A.
et al.
A statistical test suite for the validation of random number generators and pseudo random number generators for cryptographic applications. ()
This forum is intended for constructive dialog. Comments that are commercial or promotional in nature, pertain to specific medical cases, are not relevant to the article for which they have been submitted, or are otherwise inappropriate will not be posted.
We recommend that commenters identify themselves with full names and affiliations.
GIPHY App Key not set. Please check settings