{"id":9302,"date":"2023-03-14T05:26:56","date_gmt":"2023-03-14T05:26:56","guid":{"rendered":"https:\/\/www.goodacademic.com\/blog\/questions\/please-look-into-the-attached-file-or-feel-free-to-ask-if-any-information-required\/"},"modified":"2023-03-14T05:26:56","modified_gmt":"2023-03-14T05:26:56","slug":"please-look-into-the-attached-file-or-feel-free-to-ask-if-any-information-required","status":"publish","type":"questions","link":"https:\/\/www.goodacademic.com\/blog\/questions\/please-look-into-the-attached-file-or-feel-free-to-ask-if-any-information-required\/","title":{"rendered":"Please look into the attached file. Or feel free to ask if any information required."},"content":{"rendered":"<div class=\"col-sm-12 messageContent\">\n<ol>\n<li>Obtain and preprocess the dataset: Obtain the MIMIC-III<br \/>dataset and preprocess it to extract the relevant information.<br \/>This includes extracting the note ID, chart date, note text, hospital expire flag, and all diagnoses ICD9 codes associated with the note via admission ID. You can use Python or any<br \/>other programming language to extract this information from<br \/>the dataset.<\/li>\n<li>\u00ef\u00bb\u00bf\u00ef\u00bb\u00bf\u00ef\u00bb\u00bfIndex the notes with Sol: Use Sol to index the notes using<br \/>the information extracted in step 1. Use the note ID<br \/>(noteevents.row_id) as the Sol Document ID.<\/li>\n<li>\u00ef\u00bb\u00bf\u00ef\u00bb\u00bf\u00ef\u00bb\u00bfBuild a user interface: Develop a user interface that allows<br \/>users to enter query conditions and returns a list of satisfying notes. You can build a web UI or an interactive command-line interface.<\/li>\n<li>\u00ef\u00bb\u00bf\u00ef\u00bb\u00bf\u00ef\u00bb\u00bfAllow Lucene Query Syntax: Allow users to enter queries using<br \/>Lucene Query Syntax. This will enable them to search within<br \/>one or a combination of all the required information in step<\/li>\n<li>1(a) without knowing the field names in the Lucene index.<\/li>\n<li>\u00ef\u00bb\u00bf\u00ef\u00bb\u00bf\u00ef\u00bb\u00bfUse query expansion for synonyms: Implement a function to<br \/>expand queries with synonyms. Use the Consumer Health<br \/>Vocabulary (CH) in UMLS to get all English synonyms of the<br \/>input term from the user. Limit the synonyms to 30 terms for<br \/>better performance.<\/li>\n<li>\u00ef\u00bb\u00bf\u00ef\u00bb\u00bf\u00ef\u00bb\u00bfEnable user control over query expansion: Allow users to<br \/>control whether or not to use query expansion in the final<br \/>query condition.<\/li>\n<li>\u00ef\u00bb\u00bf\u00ef\u00bb\u00bf\u00ef\u00bb\u00bfEvaluate the system: Run the system against the query<br \/>conditions mentioned in the question to evaluate its<br \/>performance.<\/li>\n<\/ol>\n<div class=\"questions-requirements\">\n<p class=\"requirement\">Requirements: As required in the attached file <span style=\"font-family: sans-serif; color: #747474\"> &nbsp; | &nbsp; <\/span> .doc file | Python<\/p>\n<\/p><\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Obtain and preprocess the dataset: Obtain the MIMIC-IIIdataset and preprocess it to extract the relevant information.This includes extracting the note ID, chart date, note text, hospital expire flag, and all diagnoses ICD9 codes associated with the note via admission ID. You can use Python or anyother programming language to extract this information fromthe dataset. \u00ef\u00bb\u00bf\u00ef\u00bb\u00bf\u00ef\u00bb\u00bfIndex [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"closed","template":"","meta":[],"disciplines":[721],"paper_types":[],"tagged":[],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/questions\/9302"}],"collection":[{"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/questions"}],"about":[{"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/types\/questions"}],"author":[{"embeddable":true,"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/comments?post=9302"}],"version-history":[{"count":0,"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/questions\/9302\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/media?parent=9302"}],"wp:term":[{"taxonomy":"disciplines","embeddable":true,"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/disciplines?post=9302"},{"taxonomy":"paper_types","embeddable":true,"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/paper_types?post=9302"},{"taxonomy":"tagged","embeddable":true,"href":"https:\/\/www.goodacademic.com\/blog\/wp-json\/wp\/v2\/tagged?post=9302"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}