<?xml version="1.0" encoding="utf-8"?>
<journal>
<title>Basic and Clinical Neuroscience Journal</title>
<title_fa>مجله علوم اعصاب پایه و بالینی</title_fa>
<short_title>BCN</short_title>
<subject>Medical Sciences</subject>
<web_url>http://bcn.iums.ac.ir</web_url>
<journal_hbi_system_id>137</journal_hbi_system_id>
<journal_hbi_system_user>journal137</journal_hbi_system_user>
<journal_id_issn>2008-126X</journal_id_issn>
<journal_id_issn_online>2228-7442</journal_id_issn_online>
<journal_id_pii></journal_id_pii>
<journal_id_doi>10.32598/bcn</journal_id_doi>
<journal_id_iranmedex></journal_id_iranmedex>
<journal_id_magiran></journal_id_magiran>
<journal_id_sid></journal_id_sid>
<journal_id_nlai></journal_id_nlai>
<journal_id_science></journal_id_science>
<language>en</language>
<pubdate>
	<type>jalali</type>
	<year>1404</year>
	<month>6</month>
	<day>1</day>
</pubdate>
<pubdate>
	<type>gregorian</type>
	<year>2025</year>
	<month>9</month>
	<day>1</day>
</pubdate>
<volume>16</volume>
<number>5</number>
<publish_type>online</publish_type>
<publish_edition>1</publish_edition>
<article_type>fulltext</article_type>
<articleset>
	<article>


	<language>en</language>
	<article_id_doi></article_id_doi>
	<title_fa></title_fa>
	<title>Better than Maximum Likelihood Estimation of Model-based and Model-free Learning Styles</title>
	<subject_fa>Computational Neuroscience</subject_fa>
	<subject>Computational Neuroscience</subject>
	<content_type_fa>Original</content_type_fa>
	<content_type>Original</content_type>
	<abstract_fa></abstract_fa>
	<abstract>&lt;div style=&quot;text-align: justify;&quot;&gt;&lt;strong&gt;Introduction&lt;/strong&gt;: Various decision-making systems collaborate to shape human behavior. Goal-directed and habitual systems are the two primary systems studied by reinforcement learning (RL), with model-based (MB) and model-free (MF) learning styles, respectively. Human behavior can be viewed as a combination of these two decision-making paradigms, achieved by the weighted sum of the action values of these two styles within an RL framework. The weighting parameter is often assessed using the maximum likelihood (ML) or maximum a posteriori (MAP) estimation method.&amp;nbsp;&lt;br&gt;
&lt;strong&gt;Methods&lt;/strong&gt;: In this study, we employ RL agents that use a combination of MB and MF decision-making to perform the well-known Daw two-stage task. ML and MAP methods yield less reliable estimates of the weighting parameter, often exhibiting a large bias toward extreme values. We propose the knearest neighbor as an alternative nonparametric estimate to improve the estimation error, where we devise a set of 20 features extracted from the behavior of the RL agent. Simulated experiments examine the proposed method.&amp;nbsp;&lt;br&gt;
&lt;strong&gt;Results&lt;/strong&gt;: Our method reduces the bias and variance of the estimation error, as demonstrated by the obtained results. Human behavior data from previous studies are also investigated. The proposed method enables the prediction of indices such as age, gender, IQ, dwell time of gaze, and psychiatric disorder indices, which are not captured by the traditional method.&amp;nbsp;&lt;br&gt;
&lt;strong&gt;Conclusion&lt;/strong&gt;: In brief, the proposed method increases the reliability of the estimated parameters and enhances the applicability of RL paradigms in clinical trials.&lt;/div&gt;</abstract>
	<keyword_fa></keyword_fa>
	<keyword>Model-based (MB) and model-free (MF) combined learning, Modeling different styles of learning, k-Nearest neighbors, Maximum likelihood (ML), Maximum a posteriori (MAP), Behavioral observation analysis, Behavioral parameter estimation</keyword>
	<start_page>891</start_page>
	<end_page>912</end_page>
	<web_url>http://bcn.iums.ac.ir/browse.php?a_code=A-10-5883-1&amp;slc_lang=en&amp;sid=1</web_url>


<author_list>
	<author>
	<first_name>Sadjad</first_name>
	<middle_name></middle_name>
	<last_name>Yazdani</last_name>
	<suffix></suffix>
	<first_name_fa></first_name_fa>
	<middle_name_fa></middle_name_fa>
	<last_name_fa></last_name_fa>
	<suffix_fa></suffix_fa>
	<email>sadjad.yazdani@gmail.com</email>
	<code>13700319475328460055658</code>
	<orcid>13700319475328460055658</orcid>
	<coreauthor>No</coreauthor>
	<affiliation>Department of Machine Intelligence and Robotics, School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran.</affiliation>
	<affiliation_fa></affiliation_fa>
	 </author>


	<author>
	<first_name>Abdol-Hossein</first_name>
	<middle_name></middle_name>
	<last_name>Vahabie</last_name>
	<suffix></suffix>
	<first_name_fa></first_name_fa>
	<middle_name_fa></middle_name_fa>
	<last_name_fa></last_name_fa>
	<suffix_fa></suffix_fa>
	<email>h.vahabie@ut.ac.ir</email>
	<code>13700319475328460055659</code>
	<orcid>13700319475328460055659</orcid>
	<coreauthor>Yes
</coreauthor>
	<affiliation>Department of Machine Intelligence and Robotics, School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran.</affiliation>
	<affiliation_fa></affiliation_fa>
	 </author>


	<author>
	<first_name>Babak</first_name>
	<middle_name></middle_name>
	<last_name>Nadjar-Araabi</last_name>
	<suffix></suffix>
	<first_name_fa></first_name_fa>
	<middle_name_fa></middle_name_fa>
	<last_name_fa></last_name_fa>
	<suffix_fa></suffix_fa>
	<email>araabi@ut.ac.ir</email>
	<code>13700319475328460055660</code>
	<orcid>13700319475328460055660</orcid>
	<coreauthor>No</coreauthor>
	<affiliation>Department of Machine Intelligence and Robotics, School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran.</affiliation>
	<affiliation_fa></affiliation_fa>
	 </author>


	<author>
	<first_name>Majid</first_name>
	<middle_name></middle_name>
	<last_name>Nili Ahmadabadi</last_name>
	<suffix></suffix>
	<first_name_fa></first_name_fa>
	<middle_name_fa></middle_name_fa>
	<last_name_fa></last_name_fa>
	<suffix_fa></suffix_fa>
	<email>mnili@ut.ac.ir</email>
	<code>13700319475328460055661</code>
	<orcid>13700319475328460055661</orcid>
	<coreauthor>No</coreauthor>
	<affiliation>Department of Machine Intelligence and Robotics, School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran.</affiliation>
	<affiliation_fa></affiliation_fa>
	 </author>


</author_list>


	</article>
</articleset>
</journal>
