<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" article-type="research-article" xml:lang="en">
	<front>
		<journal-meta>
			<journal-id journal-id-type="publisher-id">AJMB</journal-id>
			<journal-title>Avicenna Journal of Medical Biotechnology</journal-title>
			<issn pub-type="ppub">2008-2835</issn>
			<issn pub-type="epub">2008-4625</issn>
			<publisher>
				<publisher-name>Avicenna Research Institute</publisher-name>
			</publisher>
		</journal-meta>
		<article-meta>
			<article-id pub-id-type="publisher-id">AJMB-4-121</article-id>
			<article-categories>
				<subj-group subj-group-type="heading">
					<subject>Original Article</subject>
				</subj-group>
			</article-categories>
			<title-group>
				<article-title>Thioredoxin System: A Model for Determining Novel Lead Molecules for Breast Cancer Chemotherapy</article-title>
			</title-group>
			<contrib-group>
				<contrib contrib-type="author" corresp="yes">
					<name>
						<surname>Jamil</surname>
						<given-names>Kaiser</given-names>
					</name>
					<xref ref-type="aff" rid="AF0001">1</xref>
					<xref ref-type="corresp" rid="cor1">&#x002A;</xref>
				</contrib>
				<contrib contrib-type="author">
					<name>
						<surname>Mustafa</surname>
						<given-names>Sabeena Muhammed</given-names>
					</name>
					<xref ref-type="aff" rid="AF0002">2</xref>
				</contrib>
			</contrib-group>
			<aff id="AF0001">
				<label>1</label>School of Life Sciences, Centre for Biotechnology and Bioinformatics (CBB), Jawaharlal Nehru Institute of Advanced Studies, Secunderabad, India</aff>
			<aff id="AF0002">
				<label>2</label>Centre for Biotechnology and Bioinformatics (CBB), Jawaharlal Nehru Institute of Advanced Studies, Secunderabad, India</aff>
			<author-notes>
				<corresp id="cor1">
				<label>&#x002A;</label>
				<bold>Corresponding author:</bold> Kaiser Jamil, Ph.D., School of Life Sciences, Centre for Biotechnology and Bioinformatics (CBB), Jawaharlal Nehru Institute of Advanced Studies, Secunderabad, India. <bold>Tel:</bold> +91 40 27541551. <bold>Fax:</bold> +91 40 27541552. <bold>E-mails:</bold> <email xlink:href="kj.bmmrc@gmail.com">kj.bmmrc@gmail.com</email>; <email xlink:href="kaiser.jamil@jnias.in">kaiser.jamil@jnias.in</email>
				</corresp>
			</author-notes>
			<pub-date pub-type="ppub">
				<season>July-September</season>
				<year>2012</year>
			</pub-date>
			<volume>4</volume>
			<issue>3</issue>
			<fpage>121</fpage>
			<lpage>130</lpage>
			<history>
				<date date-type="received">
					<day>29</day>
					<month>12</month>
					<year>2011</year>
				</date>
				<date date-type="accepted">
					<day>02</day>
					<month>05</month>
					<year>2012</year>
				</date>
			</history>
			<permissions>
				<copyright-statement>Copyright &#x00A9; 2012 Avicenna Research Institute</copyright-statement>
				<copyright-year>2012</copyright-year>
				<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by-nc/3.0/">
					<p>This work is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License which allows users to read, copy, distribute and make derivative works for non-commercial purposes from the material, as long as the author of the original work is cited properly.</p>
				</license>
			</permissions>
			<abstract>
				<sec id="st1">
				<title>Background</title>
					<p>Thioredoxin reductase 1 (TXNRD1) and thioredoxin interacting protein (TXNIP) also known as thioredoxin binding protein 2 or vitamin D3-upregulated protein 1 are key players in oxidative stress control. Thioredoxin (TRX) is one of the major components of the thiol reducing system and plays multiple roles in cellular processes. Computational analyses of TXNRD1, TXNIP and TRX expressions have not been analyzed in relation to prognosis of breast cancer. High expression of TXNRD1 and low expression of TXNIP are associated with worst prognosis in breast cancer.</p>
				</sec>
				<sec id="st2">
					<title>Methods</title>
					<p>Using bioinformatics applications we studied sequence analysis, molecular modeling, template and fold recognition, docking and scoring of thioredoxin as a target.</p>
				</sec>
				<sec id="st3">
					<title>Results</title>
					<p>The resultant model obtained was validated based on the templates from I-TASSER server and binding site residues were predicted. The predicted model was used for Threading and Fold recognition and was optimized using GROMACS. The generated model was validated using programs such as Procheck, Ramachandran plot, verify-3d and Errat value from Saves server, and the results show that the model is reliable. Next we obtained small molecules from pubchem and chembank which are databases for selecting suitable ligands for our modeled target. These molecules were screened for docking, using GOLD and scoring was obtained using Chemscore as a scoring function.</p>
					</sec>
				<sec id="st4">
					<title>Conclusion</title>
					<p>This study predicted the ligand interaction of four molecules with the minimized protein modeled structure and the best ligand with top scores from about 500 molecules screened. These were 3-hydroxy-2,3-diphenylbutanoic acid, 4-amino-3-pentadecylphenol, 3-(hydroxyimino)-2,4-diphenylbutanenitrile and 2-ethyl-1,2-diphenylbutyl carbamate, which are proposed as possible hit molecules for the drug discovery and development process.</p>
				</sec>
			</abstract>
			<kwd-group>
				<kwd>Breast cancer</kwd>
				<kwd>Chemotherapy</kwd>
				<kwd>Sequence analysis</kwd>
				<kwd>Thioredoxins</kwd>
			</kwd-group>
		</article-meta>
	</front>
	<body>
		<sec id="S0001" sec-type="intro">
			<title>Introduction</title>
			<p>Recent research has shown the importance of reduction/oxidation (redox) regulation in various biological phenomena. Thioredoxin (TRX) is one of the major components of the thiol reducing system and plays multiple roles in cellular processes such as proliferation, apoptosis, and gene expression (<xref ref-type="bibr" rid="CIT0001">1</xref>). Reactive Oxygen Species (ROS) and the cellular thiol redox state are crucial mediators of multiple cell processes like growth, differentiation and apoptosis (<xref ref-type="bibr" rid="CIT0002">2</xref>).</p>
			<p>Increased levels of thioredoxin occur in a number of human cancers, which may contribute to the resistance of cancers to therapy by scavenging ROS that are generated by various anti-cancer agents (<xref ref-type="bibr" rid="CIT0003">3</xref>). Breast cancer is a kind of malignant tumor that occurs when cells in the breast becomes so over-active that they won&#x0027;t stop multiplying (<xref ref-type="bibr" rid="CIT0004">4</xref>). Using bioinformatics approaches we have analyzed the sequences of the TXNIP in order to develop a thioredoxin model which can be used as suitable target for determining novel lead molecules for breast cancer.</p>
			<p>Members of the TRX system regulate apoptosis through a wide variety of mechanisms. A family of thioredoxin-dependent peroxidases (peroxiredoxins) protects against apoptosis by scavenging hydrogen peroxide. Thioredoxin-1 (Trx-1) is a small redox protein that is over-expressed in many human tumors, where it is associated with aggressive tumor growth and decreased patient survival. Trx-1 is secreted by tumor cells and is present at increased levels in the plasma of cancer patients. It is reported that Thioredoxin 1 (Trx-1) and Thioredoxin 2 (Trx-2) have opposed regulatory functions on hypoxia-inducible factor-1&#x03B1;<sup>&#x002A;</sup> Thioredoxin 2 is a critical regulator of cytochrome <italic>c</italic> release and mitochondrial apoptosis; transmembrane thioredoxin-related molecule (TMX) has a protective role in Endoplasmic Reticulum (ER) stress-induced apoptosis (<xref ref-type="bibr" rid="CIT0005">5</xref>).</p>
			<p>Thioredoxin is known to have important roles in the cellular responses and several studies implicate thioredoxin as a contributor to cancer progression. In cancers the tumor environment is usually under either oxidative or hypoxic stress and both stresses are known up-regulators of thioredoxin expression (<xref ref-type="bibr" rid="CIT0006">6</xref>).</p>
			<p>The Trx system is a ubiquitous thiol-reducing system that includes Trx, Trx-interacting protein (Txnip), Trx reductase (Trxr) and NADPH. Trx is a small (12 <italic>kDa</italic>) protein with a conserved active site Trp-Cys-Gly-Pro-Cys that plays an important defensive role against oxidative stress by scavenging intra-cellular ROS. Binding of ROS leads to Trx oxidation. Trxr in the presence of NADPH can convert oxidized Trx back to its reduced form. Trx proteins are represented in the cell by at least two forms; Trx1 which is present in the cytoplasm and Trx2 which is localized in the mitochondria (<xref ref-type="bibr" rid="CIT0007">7</xref>).</p>
			<p>The major aim of our study was to analyze the Thioredoxin system as it is an important target in drug discovery studies for some diseases, but our aim was to use this system to identify cancer drugs from the drug-database; therefore computational methods were used to identify the possible inhibitors to thioredoxin.</p>
		</sec>
		<sec id="S0002" sec-type="materials|methods">
			<title>Materials and Methods</title>
			<sec id="S20003">
				<title>Thioredoxin sequence analysis</title>
				<p>Thioredoxin sequence forms the basis of our study as this is a good target for cancer chemotherapy. We selected the gene sequences from UniProt KB, which is commonly used as knowledge base for molecular sequences. Most of the sequences in UniProt KB are derived from the conceptual translation of nucleotide sequences. It plays an important role by providing a stable, comprehensive, freely accessible central resource on protein sequences and functional annotation. We used computational analysis for the functional annotation for the gene sequences.</p>
			</sec>
			<sec id="S20004">
				<title>BLAST program with PSI-BLAST specification with PDB</title>
				<p>Position Specific Iterative BLAST (PSI-BLAST) profile was generated from local alignments of the most highly scoring hits in the initial BLAST results by calculating position-specific scores for every position in the alignment in the sequences. Five template sequences were generated based on this alignment program. This iterative procedure increased the sensitivity of the BLAST search and helped us to identify new relationships between the query and database entries. Clustal W was used for Multiple Sequence Alignment Program (MSA) to determine the conserved sequences among the templates.</p>
			</sec>
			<sec id="S20005">
				<title>3-D model building</title>
				<p>As the PSI- BLAST similarity obtained was less than 30%, we preferred the fold recognition method as the option to build 3-D model. Usually fold recognition methods are so efficient especially in the following cases: first, when the sequence has little or no primary sequence similarity to any sequence with a known structure. Second, when some model from the structure library represents the true fold of the sequence.</p>
				<p>Our study falls into the first category in which we tried to recognize the structural fold of the target protein from a structure template library, given its sequence information and then generate an alignment between the query and the recognized template protein, from which the structure of query protein was predicted. We used I-TASSER web server (<xref ref-type="bibr" rid="CIT0008">8</xref>) which has generated five predicted 3D models for the requests. A scoring function (C-score) based on the relative clustering structural density and the consensus significance score of multiple threading templates were obtained to estimate the accuracy of the I-TASSER predictions.</p>
			</sec>
			<sec id="S20006">
				<title>Scoring and validation of 3-D model</title>
				<p>The output of the I-TASSER server for our query protein included the prediction of secondary structure, top five full-length models with confidence scores, the estimated TM-score, RMSD, standard deviation of the estimations and top ten templates. The binding site predicted by I-TASSER server suggests 26 amino acids residues as the possible binding site residues.</p>
			</sec>
			<sec id="S20007">
				<title>Threading/fold recognition</title>
				<p>Modeller 8 was used to construct the model based on the generated templates for the following PDB ids 1G4MA, 2WTRB, 2R51A, 1CF1C, 2FAUA. All templates were taken based on their folds to construct the model.</p>
				<p>Functional characterization of a protein sequence is a common goal in biology, and is usually facilitated by having an accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation (<xref ref-type="bibr" rid="CIT0009">9</xref>).</p>
				<p>The following threading programs were used to collect the templates:<list list-type="order">
						<list-item>
							<p>MUSTER</p>
						</list-item>
						<list-item>
							<p>HHSEARCH</p>
						</list-item>
						<list-item>
							<p>SP3</p>
						</list-item>
						<list-item>
							<p>PROSPECT2</p>
						</list-item>
						<list-item>
							<p>PPA-I</p>
						</list-item>
						<list-item>
							<p>HHSEARCH I</p>
						</list-item>
						<list-item>
							<p>FUGUE</p>
						</list-item>
						<list-item>
							<p>SPARKS</p>
						</list-item>
					</list>
				</p>
				<p>This model was optimized using GROMACS (<xref ref-type="bibr" rid="CIT0010">10</xref>) which is a molecular dynamics package primarily designed for biomolecular systems such as proteins and lipids. The minimized model obtained was used for virtual screening in order to filter the compounds using GOLD score.</p>
			</sec>
			<sec id="S20008">
				<title>Molecular screening</title>
				<p>The use of virtual screening to discover new inhibitors is becoming a common practice in modern drug discovery (<xref ref-type="bibr" rid="CIT0011">11</xref>).</p>
				<p>Receptor-based virtual screens seek to &#x201C;dock&#x201D; members of a chemical library against a given protein structure, predicting the conformation and binding affinity of the small molecules (<xref ref-type="bibr" rid="CIT0012">12</xref>). All possible compounds which obey rule of 5 were collected from PUBCHEM, which is the main resource for obtaining freely-available bioassay data provided by the National Center for Biotechnology Information [NCBI] (<xref ref-type="bibr" rid="CIT0013">13</xref>).</p>
				<p>ZINC is a free database for virtual screening that contains over 4.6 million compounds in ready-to-dock, 3D formats, available at the URL <ext-link ext-link-type="uri" xlink:href="http://zinc.docking.org">http://zinc.docking.org</ext-link>. Molecules in ZINC are annotated by molecular property that include molecular weight, number of rotatable bonds, calculated LogP, number of hydrogen-bond donors, hydrogen-bond acceptors, chiral centers, chiral double bonds (E/Z isomerism), polar and a polar desolvation energy (in <italic>kcal/mol</italic>), net charge and rigid fragments. The database contains 494,915 Lipinski compliant molecules and 202,134 &#x2018;lead-like&#x2019; molecules, having molecular weight in the range 150 to 350 with calculated LogP &#x003C;4, number of hydrogen-bond donors &#x2264;3 three, and number of hydrogen-bond acceptors &#x2264;6. ZINC provides several search criteria such as molecular property constraint, ZINC codes, vendor based, and molecular substructure search (<xref ref-type="bibr" rid="CIT0014">14</xref>).</p>
			</sec>
			<sec id="S20009">
				<title>Docking and scoring</title>
				<p>The sorted ligands again were docked using GOLD and their interactions were studied. The protein was loaded into GOLD wizard, tailed by addition of hydrogens, then selection of binding site residues, choosing score (GOLD), and then the program. It is the first algorithm to be evaluated on a large dataset of complexes (<xref ref-type="bibr" rid="CIT0015">15</xref>). The result gives an empirical free energy scoring function that estimates the free energy of binding of molecules.</p>
				<p>The Chemscore function was implemented as a scoring function for the protein&#x2013;ligand docking program GOLD, and its performance compared to the original Goldscore function in terms of docking accuracy, prediction of binding affinities, and speed. In terms of producing binding energy estimates, the Gold-score function appears to perform better than the Chemscore function and the two consensus protocols, particularly for faster search settings. Even at docking speeds of around 1&#x2013;2 <italic>min</italic>/compound, the Goldscore function predicts binding energies with a standard deviation of &#x223C;10.5 <italic>kJ/mol</italic> 
 (<xref ref-type="bibr" rid="CIT0016">16</xref>). Based on the interaction studies and the Lipinski&#x0027;s rule of five, four molecules were selected as the hit molecules for further analysis in the wetlab. ChemDraw is the software used to sketch and build the molecular models.</p>
			</sec>
		</sec>
		<sec id="S0010" sec-type="results">
			<title>Results</title>
			<sec id="S20011">
				<title>Query sequence search using PSI-BLAST</title>
				<p>Utilizing PSI-BLAST in the National Center for Biotechnology Information (NCBI), we searched the sequences based on the query (UniprotKB ID: Q9H3M7) with which we generated the five templates for Thioredoxin which are shown in (<xref ref-type="fig" rid="F0001">Figure 1</xref>). PSI-BLAST program increased the sensitivity of the BLAST search and helped us to generate the possible alignment between the thioredoxin query sequences and database sequences. It also constructed a position-specific substitution matrix for the query sequence during the search. The obvious advantage of this approach was that we could obtain more accurate statistical estimates, along with the application of BLAST/PSI-BLAST in terms of speed of aligning the sequences. This gave us five corresponding PDB hits and alignments. Furthermore, by using the Multiple Sequence Alignment Program we determined the conserved positions which are shown in <xref ref-type="fig" rid="F0001">figures 1A</xref> and <xref ref-type="fig" rid="F0001">1B</xref>.</p>
				<fig id="F0001">
					<label>Figure 1</label>
					<caption>
						<p>Alignment of the templates (PDB hit) used to generate 3D model:</p>
						<p>a. Alignment shown in Clustal W Multiple Sequence Alignment (MSA) program</p>
						<p>b. Alignment shown in Clustal W Multiple Sequence Alignment (MSA) score table</p>
					</caption>
					<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="AJMB-4-121-g001.tif" alt-version="no"/>
				</fig>
			</sec>
			<sec id="S20012">
				<title>Construction of 3-D models</title>
				<p>We constructed 3-D models of these templates utilizing the I-TASSER web server. The target sequences were first threaded through a representative PDB structure library to select the best possible models from PDB, based on the templates. Using I-TASSER we obtained the scoring values as C-score, TM-score (0.08) as well as RMSD (2 <italic>&#x00C5;</italic>) values. Based on these models we could find the binding site residues. By aligning the templates and refining the iterative threading assembly (I-TASSER), Web server was used to integrate the platform for automated protein structure and function prediction. And we could generate five complete models with Combining confidence score (C-score) and protein length.</p>
			</sec>
			<sec id="S20013">
				<title>The algorithm of I-TASSER</title>
				<p>The advantage of using I-TASSER is that it first generates three-dimensional (3D) atomic models from the amino acid sequences, and then it does multiple threading alignments and iterative structural assembly simulations. Hence, the function of the protein could be inferred by structurally matching the 3D models with other known proteins. The output from the webserver run was important because it contained the information of full-length secondary and tertiary structure predictions, and functional annotations on ligand-binding sites, enzyme commission numbers and Gene Onto-logy terms. The accuracy of the predictions was based on the C-score of the modeling which depicted the best equivalent residues of two proteins based on the structural similarity and the output of TM-score.</p>
			</sec>
			<sec id="S20014">
				<title>Search for the binding sites</title>
				<p>Our search for the binding sites in the modelled structure was done by using I-TASSER server with which we could locate the exact positions of various amino acid residues at their respective binding sites. We made sure that these residues were in the binding pocket within the vicinity of the active site of the modelled protein, as shown in the results below:</p>
				<p>PHE:14 ASN:15 ASP:16 PRO:17 GLU:18 VAL:20 VAL:146 ASP:147 VAL:149 PRO: 331 PRO:332 CYS:333 TYR:334 HIS:342 ARG:343 LEU:344 GLU:345 SER:346 TYR:366 GLU:369 PHE:370 MET:373 PRO:374 PRO:376 TYR:378 THR:379</p>
				<p>The identification of putative ligand-binding sites on proteins is important for the prediction of protein function. Knowledge-based approaches using structure databases have become interesting, because of the recent increase in structural information (<xref ref-type="fig" rid="F0002">Figure 2</xref>).</p>
				<fig id="F0002">
					<label>Figure 2</label>
					<caption>
						<p>Showing the binding site view for I-TASSER model</p>
					</caption>
					<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="AJMB-4-121-g002.tif" alt-version="no"/>
				</fig>
			</sec>
			<sec id="S20015">
				<title>Verification and validation of the model by PRO-CHECK, ramachandran plot, ERRAT value and verify-3D</title>
				<p>Verification of the built model was done to ensure whether the model was programmed correctly and the algorithms were implemented. Validation results determined that the distribution of amino acid residues were at the most favourable region in the Ramachandran plot (more than 90%). This is an indication of the stereochemical quality of the model taken for the structural analysis, it and also validated the target-ligand binding efficacy of the structure. Ramachandran plot displays the main chain torsion angles phi, psi (<italic>&#x03C6;</italic>, &#x03C8;); (Ramachandran angles) in a protein of known structure (<xref ref-type="fig" rid="F0003">Figure 3</xref>).</p>
				<fig id="F0003">
					<label>Figure 3</label>
					<caption>
						<p>Secondary structure and Ramachandran plot view of the model by modeler9v8</p>
					</caption>
					<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="AJMB-4-121-g003.tif" alt-version="no"/>
				</fig>
				<p>Dihedral angle checks Ramachandran plot shows phi-psi distribution. Each residue is classified according to its region: &#x2018;core&#x2019;, &#x2018;allowed&#x2019;, &#x2018;generous&#x2019;, or &#x2018;disallowed&#x2019;. Residues in the generous and disallowed regions are high-lighted on the plot. A log-odds score shows how normal or unusual the residue&#x0027;s location is on the Ramachandran plot for the given residue type. Procheck results gave us the value of 96.3% residues in most favoured regions in R-Plot which suggests that they predict thioredoxin model of good quality (<xref ref-type="fig" rid="F0004">Figure 4A</xref>).</p>
				<fig id="F0004">
					<label>Figure 4</label>
					<caption>
						<p>Model evaluation by SAVES server. A) Ramachandran plot procheck, B) ERRAT Program, C) Verify_3D program</p>
					</caption>
					<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="AJMB-4-121-g004.tif" alt-version="no"/>
				</fig>
				<p>ERRAT is another program which we used for verifying crystal structures. Error values in this program are plotted as a function of the position of a sliding residue in the window. The error function is based on the statistics of non bonded atom-atom interactions in the modeled structure. ERRAT prompts the models to have the overall quality factor to be above 95%. And our results have shown that the value of overall quality factor was 95.506%. This confirmed that our developed model had reliable high resolution and quality compared to a database structures considered for the study (<xref ref-type="fig" rid="F0004">Figure 4B</xref>).</p>
				<p>Further we analyzed the compatibility of our predicted (3D) model by utilizing Verify-3D program with its own amino acid sequence (1D). For each residue of the amino acid the scores of a sliding 21-residue window (from -10 to +10) were added and plotted. The returned 3D-1D profile showed 3D-1D score of above 0.2 and consisted of 92.33% of the residues in the predicted model (<xref ref-type="fig" rid="F0004">Figure 4C</xref>).</p>
			</sec>
			<sec id="S20016">
				<title>Statistical analysis</title>
				<p>Statistical analysis in Ramachandran plot compared well with the observed and expected distributions of experimental observables and provided powerful tools for the quality control of our protein structure. The distribution of backbone dihedral angles (&#x2018;Ramachandran plot&#x2019;) have often been used for such quality control, but without a firm statistical foundation. Hence the output for a protein structure is a Ramachandran Z-score, expressing the quality of the Ramachandran plot relative to current state-of-the-art structures.</p>
			</sec>
			<sec id="S20017">
				<title>Model optimization</title>
				<p>Loop optimization for generating our model was done by using the software Modeller 9V8 (<xref ref-type="bibr" rid="CIT0017">17</xref>). The model had initial potential energy=55550610.692 and initial RMS gradient=1115700.680. For energy minimization of our model we used GROMACS program. This was done by using steepest descent algorithm for 1000 steps, and 5000 steps for conjugate gradient algorithm we obtained its potential energy as-931094.12296 and RMS gradient as 0.63272. The models are presented in (<xref ref-type="fig" rid="F0005">Figures 5A</xref> and <xref ref-type="fig" rid="F0005">5B</xref>).</p>
				<fig id="F0005">
					<label>Figure 5</label>
					<caption>
						<p>Model generated after optimization using steepest descent (1000) and conjugate gradient(5000) algorithm using GROMACS</p>
					</caption>
					<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="AJMB-4-121-g005.tif" alt-version="no"/>
				</fig>
			</sec>
			<sec id="S20018">
				<title>Results of docking studies</title>
				<p>We selected the ligands from PUBCHEM and ZINC data bases (almost 400), and by virtual screening using MERCURY and MARVIN VIEW we could shortlist 4 ligands as the best fit ligands. These are listed below and are presented in <xref ref-type="table" rid="T0001">table 1</xref>.
			</p>
			<p><list list-type="roman-lower">
						<list-item>
							<p>3-hydroxy-2,3-diphenylbutanoic acid,</p>
						</list-item>
						<list-item>
							<p>4-amino-3-pentadecylphenol,</p>
						</list-item>
						<list-item>
							<p>3-(hydroxyimino)-2,4-diphenylbutanenitrile and</p>
						</list-item>
						<list-item>
							<p>2-ethyl-1,2-diphenylbutyl carbamate Using these four ligands Docking studies were performed using GOLD to evaluate the best docked ligand (<xref ref-type="fig" rid="F0006">Figure 6</xref>).</p>
						</list-item>
					</list>
					</p>
				<fig id="F0006">
					<label>Figure 6</label>
					<caption>
						<p>After docking the similar ligands, totally four ligands were shown to bind with gold score greater than 50. All the four ligands, were docked to minimized structure using GOLD and the best ligand with top scores interaction is shown below</p>
					</caption>
					<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="AJMB-4-121-g006.tif" alt-version="no"/>
				</fig>
				<table-wrap id="T0001">
				<label>Table 1</label>
					<caption>
						<p>Molecule description</p>
					</caption>
					<table frame="hsides" rules="groups">
					<tbody>
							<tr>
								<td align="left"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="AJMB-4-121-ilf1.tif"/></td>
							<td align="left"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="AJMB-4-121-ilf2.tif"/></td>
							</tr>
							<tr>
								<td align="left">
									<bold>3-hydroxy-2,3-diphenylbutanoic acid</bold>
									<break/>Source: PUBCHEM<break/>Molecular Weight: 256.29644 [<italic>g/mol</italic>]<break/>Molecular Formula: C16H16O3<break/>XLogP3-AA: 3.1<break/>H-Bond Donor: 2<break/>H-Bond Acceptor: 3<break/>Rotatable Bond Count: 4<break/>Exact Mass: 256.109944<break/>MonoIsotopic Mass: 256.109944<break/>Topological Polar Surface Area: 57.5<break/>Heavy Atom Count: 19</td>
								<td align="left">
									<bold>4-amino-3-pentadecylphenol</bold>
									<break/>Source: PUBCHEM<break/>Molecular Weight: 319.52458 [<italic>g/mol</italic>]<break/>Molecular Formula: C21H37NO<break/>XLogP3-AA8: .7<break/>H-Bond Donor: 2<break/>H-Bond Acceptor: 2<break/>Rotatable Bond Count: 14<break/>Tautomer Count: 13<break/>Exact Mass: 319.287515<break/>MonoIsotopic Mass: 319.287515<break/>Topological Polar Surface Area: 46.2<break/>Heavy Atom Count: 23</td>
							</tr>
							<tr>
								<td align="left"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="AJMB-4-121-ilf3.tif"/></td>
							<td align="left"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="AJMB-4-121-ilf4.tif"/></td>
							</tr>
							<tr>
								<td align="left">
									<bold>3-(hydroxyimino)-2,4-diphenylbutanenitrile</bold>
									<break/>Source: PUBCHEM<break/>Molecular Weight: 250.29516 [<italic>g/mol</italic>]<break/>Molecular Formula: C16H14N2O<break/>XLogP3-AA: 3.4<break/>H-Bond Donor: 1<break/>H-Bond Acceptor: 3<break/>Rotatable Bond Count: 4<break/>Tautomer Count: 2<break/>Exact Mass: 250.110613<break/>MonoIsotopic Mass: 250.110613<break/>Topological Polar Surface Area: 56.4<break/>Heavy Atom Count: 19</td>
								<td align="left">
									<bold>2-ethyl-1,2-diphenylbutyl carbamate</bold>
									<break/>Source: PUBCHEM<break/>Molecular Weight: 297.39142 [<italic>g/mol</italic>]<break/>Molecular Formula: C19H23NO2<break/>XLogP3-AA: 4.6<break/>H-Bond Donor: 1<break/>H-Bond Acceptor: 2<break/>Rotatable Bond Count: 7<break/>Tautomer Count: 2<break/>Exact Mass: 297.172879<break/>MonoIsotopic Mass: 297.172879<break/>Topological Polar Surface Area: 52.3<break/>Heavy Atom Count: 22</td>
							</tr>
						</tbody>
					</table>
				</table-wrap>
				<p>Since the results depend on the choice of scoring functions obtained in GOLD (<xref ref-type="table" rid="T0002">Table 2</xref>), an analysis was performed based on the ligand binding score greater than 50. Also, there was no overlap between the top-scoring compounds from protein-ligand versus ligand-based scoring (<xref ref-type="table" rid="T0002">Table 2</xref>). The small molecules which we determined involved compounds with similar chemical structures, similar modes of action, or drug interactions.
</p>
				<table-wrap id="T0002">
				<label>Table 2</label>
					<caption>
						<p>Chemscore based interactions of molecules docked into the active site of thioredoxin</p>
					</caption>
					<table frame="hsides" rules="groups">
						<tbody>
							<tr>
								<td align="left"><bold>S.No</bold></td>
								<td align="center"><bold>Score value and molecule description</bold></td>
							</tr>
							<tr>
								<td align="left">
									<bold>1</bold>
								</td>
								<td align="center">30.36</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">8.26</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">19.90</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">0.00</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">-5.26</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">26.999 3-hydroxy-2,3-diphenylbutanoic acid</td>
							</tr>
							<tr>
								<td align="left">
									<bold>2</bold>
								</td>
								<td align="center">20.54</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">0.00</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">32.29</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">0.00</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">-23.86</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">246.000 4-amino-3-pentadecylphenol</td>
							</tr>
							<tr>
								<td align="left">
									<bold>3</bold>
								</td>
								<td align="center">32.28</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">0.27</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">24.61</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">0.00</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">-1.83</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">72.000 3-(hydroxyimino)-2,4-diphenylbutanenitrile</td>
							</tr>
							<tr>
								<td align="left">
									<bold>4</bold>
								</td>
								<td align="center">-6.17</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">0.00</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">18.39</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">0.00</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">-31.46</td>
							</tr>
							<tr>
								<td align="left"/>
								<td align="center">41.000 2-ethyl-1,2-diphenylbutyl carbamate</td>
							</tr>
						</tbody>
					</table>
				</table-wrap>
			</sec>
		</sec>
		<sec id="S0019" sec-type="discussion">
			<title>Discussion</title>
			<p>The biological activity of the all four best fit predicted molecules is very poorly documented in several databases including &#x2018;The Bibra Toxicity Profiles&#x2019; which documents critical reviews on the most pertinent toxicological data published on commercially important chemicals. Also, FDA and FDA poisonous plant database did not list these compounds. Only one compound has been listed in pubchem as anticancer drug (3-hydroxy-2,3-diphenylbutanoicacid). The molecule 3-hydroxy-2,3-diphenyl butanoic acid- was shown as anticancer drug <italic>in vivo</italic> model, [NCI] data in mice tumor model L1210 Leukemia (intraperitoneal) in B6D2F1 (BDF1) mice.</p>
		</sec>
		<sec id="S0020" sec-type="conclusion">
			<title>Conclusion</title>
			<p>However, our studies on these molecules showed these compounds as good candidates for anticancer activities. Therefore, our approach is valuable for drug discovery process and cancer therapy. Hence, now there is a need to study the pharmacological activity of these compounds in mice or <italic>in vivo</italic> models.</p>
		</sec>
	</body>
	<back>
		<ref-list>
			<title>References</title>
			<ref id="CIT0001">
				<label>1</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Nishiyama</surname>
							<given-names>A</given-names>
						</name>
						<name>
							<surname>Matsui</surname>
							<given-names>M</given-names>
						</name>
						<name>
							<surname>Iwata</surname>
							<given-names>S</given-names>
						</name>
						<name>
							<surname>Hirota</surname>
							<given-names>K</given-names>
						</name>
						<name>
							<surname>Masutani</surname>
							<given-names>H</given-names>
						</name>
						<name>
							<surname>Nakamura</surname>
							<given-names>H</given-names>
						</name>
						<etal/>
					</person-group>
					<article-title>Identification of thioredoxin-binding protein-2/vitamin D(3) up-regulated protein 1 as a negative regulator of thioredoxin function and expression</article-title>
					<source>J Biol Chem</source>
					<year>1999</year>
					<volume>274</volume>
					<issue>31</issue>
					<fpage>21645</fpage>
					<lpage>21650</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0002">
				<label>2</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Berndt</surname>
							<given-names>C</given-names>
						</name>
						<name>
							<surname>Lillig</surname>
							<given-names>CH</given-names>
						</name>
						<name>
							<surname>Holmgren</surname>
							<given-names>A</given-names>
						</name>
					</person-group>
					<article-title>Thiol-based mechanisms of the thioredoxin and glutaredoxin systems: implications for diseases in the cardiovascular system</article-title>
					<source>Am J Physiol Heart Circ Physiol</source>
					<year>2007</year>
					<volume>292</volume>
					<issue>3</issue>
					<fpage>H1227</fpage>
					<lpage>H1236</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0003">
				<label>3</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Marks</surname>
							<given-names>PA</given-names>
						</name>
					</person-group>
					<article-title>Thioredoxin in cancer-role of histone deacetylase inhibitors</article-title>
					<source>Semin Cancer Biol</source>
					<year>2006</year>
					<volume>16</volume>
					<issue>6</issue>
					<fpage>436</fpage>
					<lpage>443</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0004">
				<label>4</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Nalini</surname>
							<given-names>R</given-names>
						</name>
						<name>
							<surname>Wilma Delphine Silvia</surname>
							<given-names>CR</given-names>
						</name>
						<name>
							<surname>Makhija</surname>
							<given-names>PM</given-names>
						</name>
						<name>
							<surname>Uthappa</surname>
							<given-names>S</given-names>
						</name>
					</person-group>
					<article-title>Usefulness of serum ca 15.3 and histo-pathological prognostic indices in breast cancer</article-title>
					<source>Indian J Clin Biochem</source>
					<year>2005</year>
					<volume>20</volume>
					<issue>1</issue>
					<fpage>165</fpage>
					<lpage>168</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0005">
				<label>5</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Masutani</surname>
							<given-names>H</given-names>
						</name>
						<name>
							<surname>Ueda</surname>
							<given-names>S</given-names>
						</name>
						<name>
							<surname>Yodoi</surname>
							<given-names>J</given-names>
						</name>
					</person-group>
					<article-title>The thioredoxin system in retroviral infection and apoptosis</article-title>
					<source>Cell Death Differ</source>
					<year>2005</year>
					<volume>12</volume>
					<supplement>Suppl 1</supplement>
					<fpage>991</fpage>
					<lpage>998</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0006">
				<label>6</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Karlenius</surname>
							<given-names>TC</given-names>
						</name>
						<name>
							<surname>Tonissen</surname>
							<given-names>KF</given-names>
						</name>
					</person-group>
					<article-title>Thioredoxin and cancer: A role for Thioredoxin in all states of tumor oxygenation</article-title>
					<source>Cancers</source>
					<year>2010</year>
					<volume>2</volume>
					<issue>2</issue>
					<fpage>209</fpage>
					<lpage>232</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0007">
				<label>7</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Munemasa</surname>
							<given-names>Y</given-names>
						</name>
						<name>
							<surname>Ahn</surname>
							<given-names>JH</given-names>
						</name>
						<name>
							<surname>Kwong</surname>
							<given-names>JM</given-names>
						</name>
						<name>
							<surname>Caprioli</surname>
							<given-names>J</given-names>
						</name>
						<name>
							<surname>Piri</surname>
							<given-names>N</given-names>
						</name>
					</person-group>
					<article-title>Redox proteins thioredoxin 1 and thioredoxin 2 support retinal ganglion cell survival in experimental glaucoma</article-title>
					<source>Gene Ther</source>
					<year>2009</year>
					<volume>16</volume>
					<issue>1</issue>
					<fpage>17</fpage>
					<lpage>25</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0008">
				<label>8</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Zhang</surname>
							<given-names>Y</given-names>
						</name>
					</person-group>
					<article-title>I-TASSER server for protein 3D structure prediction</article-title>
					<source>BMC Bioinformatics</source>
					<year>2008</year>
					<volume>9</volume>
					<fpage>40</fpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0009">
				<label>9</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Kotra</surname>
							<given-names>S</given-names>
						</name>
						<name>
							<surname>Madala</surname>
							<given-names>KK</given-names>
						</name>
						<name>
							<surname>Jamil</surname>
							<given-names>K</given-names>
						</name>
					</person-group>
					<article-title>Homology models of the mutated EGFR and their response towards quinazolin analogues</article-title>
					<source>J Mol Graph Model</source>
					<year>2008</year>
					<volume>27</volume>
					<issue>3</issue>
					<fpage>244</fpage>
					<lpage>254</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0010">
				<label>10</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Hess</surname>
							<given-names>B</given-names>
						</name>
						<name>
							<surname>Kutzner</surname>
							<given-names>C</given-names>
						</name>
						<name>
							<surname>van der Spoel</surname>
							<given-names>D</given-names>
						</name>
						<name>
							<surname>Lindahl</surname>
							<given-names>E</given-names>
						</name>
					</person-group>
					<article-title>GROMACS 4: algorithms for highly efficient, load-balanced and scalable molecular simulation</article-title>
					<source>J Chem Theory Comput</source>
					<year>2008</year>
					<volume>4</volume>
					<issue>3</issue>
					<fpage>435</fpage>
					<lpage>447</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0011">
				<label>11</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Shoichet</surname>
							<given-names>BK</given-names>
						</name>
					</person-group>
					<article-title>Virtual screening of chemical libraries</article-title>
					<source>Nature</source>
					<year>2004</year>
					<volume>432</volume>
					<fpage>862</fpage>
					<lpage>865</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0012">
				<label>12</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Kitchen</surname>
							<given-names>DB</given-names>
						</name>
						<name>
							<surname>Decornez</surname>
							<given-names>H</given-names>
						</name>
						<name>
							<surname>Furr</surname>
							<given-names>JR</given-names>
						</name>
						<name>
							<surname>Bajorath</surname>
							<given-names>J</given-names>
						</name>
					</person-group>
					<article-title>Docking and scoring in virtual screening for drug discovery: methods and applications</article-title>
					<source>Nat Rev Drug Discov</source>
					<year>2004</year>
					<volume>3</volume>
					<issue>11</issue>
					<fpage>935</fpage>
					<lpage>949</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0013">
				<label>13</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Wang</surname>
							<given-names>Y</given-names>
						</name>
						<name>
							<surname>Xiao</surname>
							<given-names>J</given-names>
						</name>
						<name>
							<surname>Suzek</surname>
							<given-names>TO</given-names>
						</name>
						<name>
							<surname>Zhang</surname>
							<given-names>J</given-names>
						</name>
						<name>
							<surname>Wang</surname>
							<given-names>J</given-names>
						</name>
						<name>
							<surname>Bryant</surname>
							<given-names>SH</given-names>
						</name>
					</person-group>
					<article-title>PubChem: a public information system for analyzing bioactivities of small molecules</article-title>
					<source>Nucleic Acids Res</source>
					<year>2009</year>
					<volume>37</volume>
					<issue>Web Server issue</issue>
					<fpage>W623</fpage>
					<lpage>33</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0014">
				<label>14</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Sudha</surname>
							<given-names>KN</given-names>
						</name>
						<name>
							<surname>Shakira</surname>
							<given-names>M</given-names>
						</name>
						<name>
							<surname>Prasanthi</surname>
							<given-names>P</given-names>
						</name>
						<name>
							<surname>Sarika</surname>
							<given-names>N</given-names>
						</name>
						<name>
							<surname>Kumar</surname>
							<given-names>ChN</given-names>
						</name>
						<name>
							<surname>Babu</surname>
							<given-names>PA</given-names>
						</name>
					</person-group>
					<article-title>Virtual screening for novel COX-2 inhibitors using the ZINC database</article-title>
					<source>Bioinformation</source>
					<year>2008</year>
					<volume>2</volume>
					<issue>8</issue>
					<fpage>325</fpage>
					<lpage>329</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0015">
				<label>15</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Verdonk</surname>
							<given-names>ML</given-names>
						</name>
						<name>
							<surname>Chessari</surname>
							<given-names>G</given-names>
						</name>
						<name>
							<surname>Cole</surname>
							<given-names>JC</given-names>
						</name>
						<name>
							<surname>Hartshorn</surname>
							<given-names>MJ</given-names>
						</name>
						<name>
							<surname>Murray</surname>
							<given-names>CW</given-names>
						</name>
						<name>
							<surname>Nissink</surname>
							<given-names>JW</given-names>
						</name>
						<etal/>
					</person-group>
					<article-title>Modeling water molecules in protein-ligand docking using GOLD</article-title>
					<source>J Med Chem</source>
					<year>2005</year>
					<volume>48</volume>
					<issue>20</issue>
					<fpage>6504</fpage>
					<lpage>6515</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0016">
				<label>16</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Verdonk</surname>
							<given-names>ML</given-names>
						</name>
						<name>
							<surname>Cole</surname>
							<given-names>JC</given-names>
						</name>
						<name>
							<surname>Hartshorn</surname>
							<given-names>MJ</given-names>
						</name>
						<name>
							<surname>Murray</surname>
							<given-names>CW</given-names>
						</name>
						<name>
							<surname>Taylor</surname>
							<given-names>RD</given-names>
						</name>
					</person-group>
					<article-title>Improved protein-ligand docking using GOLD</article-title>
					<source>Proteins</source>
					<year>2003</year>
					<volume>52</volume>
					<issue>4</issue>
					<fpage>609</fpage>
					<lpage>623</lpage>
				</nlm-citation>
			</ref>
			<ref id="CIT0017">
				<label>17</label>
				<nlm-citation citation-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Eswar</surname>
							<given-names>N</given-names>
						</name>
						<name>
							<surname>Webb</surname>
							<given-names>B</given-names>
						</name>
						<name>
							<surname>Marti-Renom</surname>
							<given-names>MA</given-names>
						</name>
						<name>
							<surname>Madhusudhan</surname>
							<given-names>MS</given-names>
						</name>
						<name>
							<surname>Eramian</surname>
							<given-names>D</given-names>
						</name>
						<name>
							<surname>Shen</surname>
							<given-names>MY</given-names>
						</name>
						<etal/>
					</person-group>
					<article-title>Comparative protein structure modeling using MODELLER</article-title>
					<source>Curr Protoc Protein Sci</source>
					<year>2007</year>
					<comment>Chapter 2:Unit 2.9</comment>
					</nlm-citation>
			</ref>
		</ref-list>
	</back>
</article>
