Handling Non-English Clinical Trials Data in SDTM Conversions for US-FDA Submissions Ramanan D. Bathala, TAKE Solutions Limited, Chennai, INDIA

Similar documents
An Alternate Way to Create the Standard SDTM Domains

CDISC Variable Mapping and Control Terminology Implementation Made Easy

Unofficial Redmine Cooking - QA #782 yaml_db を使った DB のマイグレーションで失敗する

Submission-Ready Define.xml Files Using SAS Clinical Data Integration Melissa R. Martinez, SAS Institute, Cary, NC USA

Study Composer: a CRF design tool enabling the re-use of CDISC define.xml metadata

Analysis on the Multi-stakeholder Structure in the IGF Discussions

Software Prototyping ( プロトタイピング ) Animating and demonstrating system requirements


PharmaSUG Paper AD03

Nonfinancial Reporting Track:03 Providing non-financial information to reporters, analysts and asset managers; the EDINET Case

PharmaSUG 2014 PO16. Category CDASH SDTM ADaM. Submission in standardized tabular form. Structure Flexible Rigid Flexible * No Yes Yes

Doctor's Prescription to Re-engineer Process of Pinnacle 21 Community Version Friendly ADaM Development

How to handle different versions of SDTM & DEFINE generation in a Single Study?

Legacy to SDTM Conversion Workshop: Tools and Techniques

The Wonderful World of Define.xml.. Practical Uses Today. Mark Wheeldon, CEO, Formedix DC User Group, Washington, 9 th December 2008

Googleの強みは ささえるのは世界一のインフラ. Google File System 2008年度後期 情報システム構成論2 第10回 クラウドと協調フィルタリング. 初期(1999年)の Googleクラスタ. 最近のデータセンタ Google Chrome Comicより

Automate Clinical Trial Data Issue Checking and Tracking

Best Practices for E2E DB build process and Efficiency on CDASH to SDTM data Tao Yang, FMD K&L, Nanjing, China

How to write ADaM specifications like a ninja.

Preparing the Office of Scientific Investigations (OSI) Requests for Submissions to FDA

Certificate of Accreditation

Codelists Here, Versions There, Controlled Terminology Everywhere Shelley Dunn, Regulus Therapeutics, San Diego, California

Edwin Ponraj Thangarajan, PRA Health Sciences, Chennai, India Giri Balasubramanian, PRA Health Sciences, Chennai, India

PharmaSUG Paper PO10

SoftBank 101SI Upgrade tool for Windows PC Instruction Manual

Methods to Detect Malicious MS Document File using File Structure Inspection

API サーバの URL. <?xml version="1.0" encoding="utf-8"?> <!DOCTYPE COMPLIANCE_SCAN SYSTEM "

The Judgment of Document Similarities Orthogonal Transformations and Improvement of the property

J の Lab システムの舞台裏 - パワーポイントはいらない -

携帯電話の 吸収率 (SAR) について / Specific Absorption Rate (SAR) of Mobile Phones

Creating a Patient Profile using CDISC SDTM Marc Desgrousilliers, Clinovo, Sunnyvale, CA Romain Miralles, Clinovo, Sunnyvale, CA

携帯電話の 吸収率 (SAR) について / Specific Absorption Rate (SAR) of Mobile Phones

PharmaSUG Paper PO21

Pharmaceuticals, Health Care, and Life Sciences. An Approach to CDISC SDTM Implementation for Clinical Trials Data

The Benefits of Traceability Beyond Just From SDTM to ADaM in CDISC Standards Maggie Ci Jiang, Teva Pharmaceuticals, Great Valley, PA

PhUSE Paper SD09. "Overnight" Conversion to SDTM Datasets Ready for SDTM Submission Niels Mathiesen, mathiesen & mathiesen, Basel, Switzerland

Preparing Information Design-Oriented. Posters. easy to. easy to. See! Understand! easy to. Convey!

SDTM Attribute Checking Tool Ellen Xiao, Merck & Co., Inc., Rahway, NJ

Tips on Creating a Strategy for a CDISC Submission Rajkumar Sharma, Nektar Therapeutics, San Francisco, CA

Converting Data to the SDTM Standard Using SAS Data Integration Studio

Planning to Pool SDTM by Creating and Maintaining a Sponsor-Specific Controlled Terminology Database

Sandra Minjoe, Accenture Life Sciences John Brega, PharmaStat. PharmaSUG Single Day Event San Francisco Bay Area

Yamaha Steinberg USB Driver V for Mac Release Notes

アルゴリズムの設計と解析 (W4022) 教授 : 黄潤和 広野史明 (A4/A10)

PharmaSUG Paper CC02

PharmaSUG. companies. This paper. will cover how. processes, a fairly linear. before moving. be carried out. Lifecycle. established.

esubmission - Are you really Compliant?

How a Metadata Repository enables dynamism and automation in SDTM-like dataset generation

Why organizations need MDR system to manage clinical metadata?

Experience of electronic data submission via Gateway to PMDA

Certificate of Accreditation

本書について... 7 本文中の表記について... 7 マークについて... 7 MTCE をインストールする前に... 7 ご注意... 7 推奨 PC 仕様... 8 MTCE をインストールする... 9 MTCE をアンインストールする... 11

A Taste of SDTM in Real Time

BMW Head Up Display (HUD) Teardown BMW ヘッドアップディスプレイティアダウン

Programming checks: Reviewing the overall quality of the deliverables without parallel programming

PhUSE US Connect 2019

Streamline SDTM Development and QC

JASCO-HPLC Operating Manual. (Analytical HPLC)

An Efficient Solution to Efficacy ADaM Design and Implementation

SAS Application to Automate a Comprehensive Review of DEFINE and All of its Components

Online Meetings with Zoom

4M Change Notice NTC Lead type for Temperature Sensor : NXR Series

暗い Lena トーンマッピング とは? 明るい Lena. 元の Lena. tone mapped. image. original. image. tone mapped. tone mapped image. image. original image. original.

Application Interface for executing a batch of SAS Programs and Checking Logs Sneha Sarmukadam, inventiv Health Clinical, Pune, India

Comparison of FDA and PMDA Requirements for Electronic Submission of Study Data

振込依頼書記入要領 Entry Guide for Direct Deposit Request Form

MySQL Cluster 7.3 リリース記念!! 5 分で作る MySQL Cluster 環境

APRIN e-learning Program (CITI Japan) User s Manual

Scrambling of Un-Blinded Data without Scrambling Data Integrity! Jaya Baviskar, Pharmanet/i3, Mumbai, India

JPShiKen.COM 全日本最新の IT 認定試験問題集. 1 年で無料進級することに提供する

Interdomain Routing Security Workshop 21 BGP, 4 Bytes AS. Brocade Communications Systems, K.K.

Kazunari Okada( 岡田一成 ) Sr. Technical Marketing Manager ISO Vibration Analyst (CAT II) National Instruments Corporation Japan

Improving Metadata Compliance and Assessing Quality Metrics with a Standards Library

Figure 1. Table shell

Automated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ

Automation of makefile For Use in Clinical Development Nalin Tikoo, BioMarin Pharmaceutical Inc., Novato, CA

IRS16: 4 byte ASN. Version: 1.0 Date: April 22, 2008 Cisco Systems 2008 Cisco, Inc. All rights reserved. Cisco Systems Japan

Energy Conservation & Productivity Enhancement Practices The Japan Perspective

Saki is a Japanese high school student who/ has just started to study/ in the US.//

Zabbix ログ解析方法. 2018/2/14 サイバートラスト株式会社 Linux/OSS 事業部技術統括部花島タケシ. Copyright Cybertrust Japan Co., Ltd. All rights reserved.

SAS (Statistical Analysis Software/System)

Yamaha Steinberg USB Driver V for Windows Release Notes

PharmaSUG China Paper 70

Infragistics ASP.NET リリースノート

Implementing CDISC Using SAS. Full book available for purchase here.

A. 展開図とそこから折れる凸立体の研究 1. 複数の箱が折れる共通の展開図 2 通りの箱が折れる共通の展開図 3 通りの箱が折れる共通の展開図そして. 残された未解決問題たち 2. 正多面体の共通の展開図 3. 正多面体に近い立体と正 4 面体の共通の展開図 ( 予備 )

SAS Programming Techniques for Manipulating Metadata on the Database Level Chris Speck, PAREXEL International, Durham, NC

サンプル. NI TestStand TM I: Introduction Course Manual

Advantages of a real end-to-end approach with CDISC standards

SAS Drug Development Program Portability

Cost-Benefit Analysis of Retrospective vs. Prospective Data Standardization

Studies of Large-Scale Data Visualization: EXTRAWING and Visual Data Mining

Paper William E Benjamin Jr, Owl Computer Consultancy, LLC

Paper FC02. SDTM, Plus or Minus. Barry R. Cohen, Octagon Research Solutions, Wayne, PA

4M Change Notice. NTC Lead type for Temperature Sensor : NXR Series

From Implementing CDISC Using SAS. Full book available for purchase here. About This Book... xi About The Authors... xvii Acknowledgments...

~ ソフトウエア認証への取り組みと課題 ~

JAMSTEC Tsutao Oizumi

Data Edit-checks Integration using ODS Tagset Niraj J. Pandya, Element Technologies Inc., NJ Vinodh Paida, Impressive Systems Inc.

Vehicle Calibration Techniques Established and Substantiated for Motorcycles

Transcription:

PharmaSUG2010 - Paper PO13 Handling Non-English Clinical Trials Data in SDTM Conversions for US-FDA Submissions Ramanan D. Bathala, TAKE Solutions Limited, Chennai, INDIA 1. ABSTRACT Converting Legacy clinical trial data in to CDISC-SDTM (Study Data Tabulations Model) for US-FDA submission purposes is challenge by itself. If these trials were conducted outside United States and that legacy data has non-english content, it poses additional challenges of translating them to English before converting to SDTM standard, and still maintaining the data integrity. This paper presents a case study on how this has been achieved for clinical trials data which had Japanese language contents. This paper also outlines the challenges faced and solutions adapted to address those challenges. Though this case study talks about how this has been done for clinical trial data in Japanese language, but the same concepts are applicable to clinical trials data collected in other languages as well. The focus of this paper is SAS options and techniques used to; view the non-english data, extract such non-english text from the data, insert the translated English text in respective observation and corresponding variables, cost effective use of translation services and finally how to maintain the data integrity during the entire process. 2. INTRODUCTION The players involved in this translation and SDTM conversion of collected (RAW) data were a CRO, Sponsor Company and a third party, language translation service provider. The problems which were to be handled are multidimensional including translating the protocol, CRF, CSR and most importantly; last but not least SAS data items in the submission. (SAS Data Items is a collective term used in this paper to represent both SAS datasets and SAS Format catalogs). The last part, Translating SAS Data Items with Japanese text into SAS Data Items in English is core topic and focus of this paper. The following flow diagram illustrates the workflow involved in translating SAS data items. Figure 1: Translation work-flow 1

3. CHALLENGES AND SOLUTIONS The following are few critical challenges we faced in translating SAS Data Items. 1. First challenge was finding out what needs to be done in SAS 9.1.3, such that one can see the Japanese contents in SAS data items. We used SAS NLS (National Language Support) options and more details on this are discussed in following sections. 2. How Japanese text data can be sent to the translation service provider, whose staff are not equipped with skills and tools to view SAS datasets, and how to insert the translated text in respective observations and corresponding variables. 3. How to extract the non-english text from SAS datasets using SAS 9.1.3. The techniques used in SAS are discussed in later sections. 4. How to minimize the use of the translation services, because typically language translation service providers charge the sponsor or CRO by the number of words/sentences/pages they translate. 5. How to maintain the data integrity during the entire process. 3.1 SAS NLS OPTIONS We have to enable DBCS support through the following options in SAS configuration file. More information on this subject can be found in page 14 of User Installation Guide available at http://support.sas.com/documentation/installcenter/the_sas_system/9.1.3_ts1m3/win/win/user.pdf Upon following the instructions the following NLS options should appear in the SAS configuration file. -DBCS -DBCSTYPE PCMS -DBCSLANG JAPANESE To view the Japanese text we should also install relevant fonts in the operating system by following the steps given below. 1. Click Start>Settings>Control Panel>Regional and Language Options. 2. Under Languages tab>supplemental language support, select the "Install files for east Asian languages" check box. 3. Under Advanced tab>language for Non-unicode programs, select Japanese 4. Click OK and reboot. 3.2 SENDING JAPANESE TEXT FOR TRANSLATION The other challenge is how these Japanese text strings can be sent for translation. The translators are not likely to have enough expertise to see and work with SAS tools for viewing datasets with Japanese text data. So, we have to extract them and put them into a some commonly used tools like excel workbooks or sheets so that they can view them and translate them and return it to the CRO/Sponsor organization. To extract the Japanese text strings from SAS Data Items a SAS program is written and executed for a given study. This SAS program reads every data item in that study and creates one excel workbook with following sheets in it. 1. Variables (All variables whose labels are in Japanese) 2. Values (All Japanese text values of all variables from all datasets) 3. Decodes (Format catalog entries whose values are in Japanese) 2

Variables sheet would have the following columns in it. The highlighted column (englabel) will be filled in by the dsname varnum varname label englabel AE 6 AE_NM 有害事象名 AE 7 AE_CD 有害事象コード AE 8 AE_DF 有害事象発現日 AE 9 SEVER 重症度 AE 10 SERIOUS 重篤性 Figure 2: Variables Sheet Values sheet would have the following columns in it. The highlighted column (engvalue) will be filled in by the dsname varname value engvalue AE ACTION 減量その後増量 AE AE_CMT 偶発事象である AE AE_CMT 偶発的なものであり治験薬との関連なし AE AE_CMT 偶発的な事象であり 治験薬との因果関係は無い AE AE_CMT 偶発的な事象である AE AE_CMT 偶発的な事象であると判断する AE AE_NM プロラクチン上昇 AE AE_NM 過鎮静 AE AE_NM 寒気 AE AE_NM 感冒 AE AE_NM 肝機能異常 AE AE_NM 疑惑 AE AE_NM 急性上気道炎 AE AE_NM 筋強剛 Figure 3: Values Sheet Decodes sheet will have the following columns in it. The highlighted column (engdecode) will be filled in by the fmtname code japdecode engdecode AAA001S 1 なし AAA001S 2 あり AAA002S 1 なし AAA002S 2 あり AAA002S 8 不明 Figure 4: Decodes Sheet 3.3 EXTRACTING JAPANESE TEXT The other challenge we faced is how to identify Japanese text in SAS Data and how those can be extracted from them. Because of the fact that Japanese characters are symbols and there are many of them it is almost impossible for us to as non-japanese professionals we could not come up with the finite number of Japanese characters to look for in the data. Rather it would be easier to identify come up with the finite list of non-japanese list of characters in data. Hence the following lists of possible non-japanese character strings are formed. 1. English Alphabet Characters(A to Z and a to z), which are stored in macro variable alpchar 2. Special Characters like (~!@#$%^&*(),./'"[ ]-=<>?:{}_+;), \µγ which are stored in macro variable splchar 3

3. Numeric Characters(0 to 9) which are stored in macro variable numchar 4. Miscellaneous double byte characters found in Values(.0123456789 ) which are stored in macro variable mischar With these char list is defined, we can use a SAS function KVERIFY to help us identify if the value has any other character other than the ones in these four strings. The KVERIFY function returns the position of the first character in source that is not present in any of the supplied arguments. If KVERIFY finds that every character in source is in at least one excerpt, it returns a 0. The kverify function used in the code as kverify(value, "&alpchar", "&splchar", "&numchar", "&mischar") > 0 This piece of code yields a true condition if any Japanese character is present in the value. 3.4 MINIMIZING THE TRANSLATION COST Since, typically language translation service providers charge the sponsor or CRO by the number of words/ sentences/ pages they translate, we have to minimize the translation cost by only sending the unique Japanese text strings for translation. We can easily do it for the values of variables. Since the variable values contain majority of the Japanese text strings if we can identify the unique strings here per variable per dataset, it would significantly reduce the translation cost. But, this comes with its own complexity of updating the English values for the corresponding Japanese text strings wherever they appear in a particular variable in that dataset. But this complexity was handled by writing a SAS macro code one time and the same was reused for many other studies. 3.5 MAINTAINING DATA INTEGRITY Maintaining the data integrity is most important in this process. We have to ensure that the rows and columns are not deleted in the excel sheets that were exported for translation. For this purpose we can use excel cell locking feature. Apart from locking we have to make sure that the QC process has check list with the following. 1. Number of rows and columns submitted in each excel sheet for translation. 2. Number of rows and columns received from translation service provider in each sheet should match the counts in item 1. 3. After the successful import of the sheets and English text replacement, we need to also make sure that the observations and variable counts are matching between the original datasets and the translated datasets. 4. We should also preserve the sort order during the entire process and make sure that the observations in translated datasets are retained in sequence as they were in the original datasets. 4. CONCLUSION Now that the datasets and format catalogs have all translated English text one can go ahead with the CDISC SDTM standardization of all the RAW datasets as per the process that CRO follows. 5. REFERENCES http://support.sas.com/documentation/installcenter/the_sas_system/9.1.3_ts1m3/win/win/user.pdf 6. CONTACT INFORMATION Your comments and questions are valued and encouraged. Contact the author at: Name : Ramanan D. Bathala Enterprise : TAKE Solutions Limited Address : 80/81 MBC Towers, 6 th Floor, TTK Road City, State ZIP : Chennai, INDIA 600 118. Work Phone : +91-44-6696 4306 Fax : +91-44-4216 9348 E-mail : ramanan.dayalan@takesolutions.com Web : www.takesolutions.com 4

SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies. 5