Molecular Connections deployed graduates and post grads in Chemistry, both applied and physical science to frame rules for the selection of key words that represent concepts and chemical substances relating to the novelty of the scientific journals, and doctorates and subject matter experts in areas of chemistry to interpret and correlate the reaction mechanism and to validate the most appropriate reactions analysis from research articles.
For processing scientific journals in Asian languages like Chinese, Japanese and for Russian, MC developed a hybrid multilingual indexing solution using technology-enabled automation processes with statistical methods and Natural Language Processing (NLP) rules. Further MC team developed a knowledge repository of unique concepts that recur frequently in scientific journals and also which contains instant solutions for different variants of reactions procedures handled by authors (polymer , peptides, synthesis etc) through client feedback and queries.
MC analysed the unified process and segregated the process as indexing and reaction analysis. Indexing output became the input for the reaction analysis which has been integrated in work flow platform.
Technology Role in Excerption and Curation
MC automated by deploying a robust technology platform with three modules – auto indexing of substances, writing up of reactions for indexed substances and structures. The platform facilitated the auto indexing of key concepts and substances based on the thesaurus developed. It also provides the auto indexed key words under the headings of Title, Abstract, Tables and Figures for easy enhancement. Additionally, the platform provides option for including new key words in thesaurus for its learning. Indexing output serves as the input for reaction analysis which has been integrated in work flow platform. Frequently used Markush structures, reaction participants like solvent, catalyst, reagent are in-built in the platform as drop down options.
To process the high volume, MC workflow platform was hosted on in-house and cloud for seamless and continuous processing without interrupting the production process. This helped to meet the deadline irrespective of the volume