Trouble parsing into metadata

Hi, I’m a total Clover n00b, first time post. I’m using Clover from within Eclipse.

INFO [main] - *** CloverETL framework/transformation graph, (c) 2002-2010 Javlin a.s, released under GNU Lesser General Public License ***
INFO [main] - Running with CloverETL library version 2.9.7 compiled 02/09/2010 11:56:15

I’m trying to create my metadata from the header row in my datafile. The datafile is a CSV with “” around each field. (below is a sample). When I try to “new metadata”->“extract from flat file” encoding ISO-8859-1 and delimited, I just get a red Record:recordName1.

I’ve tried changing the record delimiter and field delimiters. I don’t know if its that the field names would be too long, or if some field has a character that is unsupported. I’ve been playing with various incantations and cant get it to work.

Below is a sample.

I’m hoping someone can suggest something.

Thank you,
Jay

“NPI”,“Entity_Type_Code”,“Replacement_NPI”,“Employer_Identification_Number__EIN_”,“Provider_Organization_Name__Legal_Business_Name_”,“Provider_Last_Name__Legal_Name_”,“Provider_First_Name”,“Provider_Middle_Name”,“Provider_Name_Prefix_Text”,“Provider_Name_Suffix_Text”,“Provider_Credential_Text”,“Provider_Other_Organization_Name”,“Provider_Other_Organization_Name_Type_Code”,“Provider_Other_Last_Name”,“Provider_Other_First_Name”,“Provider_Other_Middle_Name”,“Provider_Other_Name_Prefix_Text”,“Provider_Other_Name_Suffix_Text”,“Provider_Other_Credential_Text”,“Provider_Other_Last_Name_Type_Code”,“Provider_First_Line_Business_Mailing_Address”,“Provider_Second_Line_Business_Mailing_Address”,“Provider_Business_Mailing_Address_City_Name”,“Provider_Business_Mailing_Address_State_Name”,“Provider_Business_Mailing_Address_Postal_Code”,“Provider_Business_Mailing_Address_Country_Code__If_outside_U.S._”,“Provider_Business_Mailing_Address_Telephone_Number”,“Provider_Business_Mailing_Address_Fax_Number”,“Provider_First_Line_Business_Practice_Location_Address”,“Provider_Second_Line_Business_Practice_Location_Address”,“Provider_Business_Practice_Location_Address_City_Name”,“Provider_Business_Practice_Location_Address_State_Name”,“Provider_Business_Practice_Location_Address_Postal_Code”,“Provider_Business_Practice_Location_Address_Country_Code__If_outside_U.S._”,“Provider_Business_Practice_Location_Address_Telephone_Number”,“Provider_Business_Practice_Location_Address_Fax_Number”,“Provider_Enumeration_Date”,“Last_Update_Date”,“NPI_Deactivation_Reason_Code”,“NPI_Deactivation_Date”,“NPI_Reactivation_Date”,“Provider_Gender_Code”,“Authorized_Official_Last_Name”,“Authorized_Official_First_Name”,“Authorized_Official_Middle_Name”,“Authorized_Official_Title_or_Position”,“Authorized_Official_Telephone_Number”,“Healthcare_Provider_Taxonomy_Code_1”,“Provider_License_Number_1”,“Provider_License_Number_State_Code_1”,“Healthcare_Provider_Primary_Taxonomy_Switch_1”,“Healthcare_Provider_Taxonomy_Code_2”,“Provider_License_Number_2”,“Provider_License_Number_State_Code_2”,“Healthcare_Provider_Primary_Taxonomy_Switch_2”,“Healthcare_Provider_Taxonomy_Code_3”,“Provider_License_Number_3”,“Provider_License_Number_State_Code_3”,“Healthcare_Provider_Primary_Taxonomy_Switch_3”,“Healthcare_Provider_Taxonomy_Code_4”,“Provider_License_Number_4”,“Provider_License_Number_State_Code_4”,“Healthcare_Provider_Primary_Taxonomy_Switch_4”,“Healthcare_Provider_Taxonomy_Code_5”,“Provider_License_Number_5”,“Provider_License_Number_State_Code_5”,“Healthcare_Provider_Primary_Taxonomy_Switch_5”,“Healthcare_Provider_Taxonomy_Code_6”,“Provider_License_Number_6”,“Provider_License_Number_State_Code_6”,“Healthcare_Provider_Primary_Taxonomy_Switch_6”,“Healthcare_Provider_Taxonomy_Code_7”,“Provider_License_Number_7”,“Provider_License_Number_State_Code_7”,“Healthcare_Provider_Primary_Taxonomy_Switch_7”,“Healthcare_Provider_Taxonomy_Code_8”,“Provider_License_Number_8”,“Provider_License_Number_State_Code_8”,“Healthcare_Provider_Primary_Taxonomy_Switch_8”,“Healthcare_Provider_Taxonomy_Code_9”,“Provider_License_Number_9”,“Provider_License_Number_State_Code_9”,“Healthcare_Provider_Primary_Taxonomy_Switch_9”,“Healthcare_Provider_Taxonomy_Code_10”,“Provider_License_Number_10”,“Provider_License_Number_State_Code_10”,“Healthcare_Provider_Primary_Taxonomy_Switch_10”,“Healthcare_Provider_Taxonomy_Code_11”,“Provider_License_Number_11”,“Provider_License_Number_State_Code_11”,“Healthcare_Provider_Primary_Taxonomy_Switch_11”,“Healthcare_Provider_Taxonomy_Code_12”,“Provider_License_Number_12”,“Provider_License_Number_State_Code_12”,“Healthcare_Provider_Primary_Taxonomy_Switch_12”,“Healthcare_Provider_Taxonomy_Code_13”,“Provider_License_Number_13”,“Provider_License_Number_State_Code_13”,“Healthcare_Provider_Primary_Taxonomy_Switch_13”,“Healthcare_Provider_Taxonomy_Code_14”,“Provider_License_Number_14”,“Provider_License_Number_State_Code_14”,“Healthcare_Provider_Primary_Taxonomy_Switch_14”,“Healthcare_Provider_Taxonomy_Code_15”,“Provider_License_Number_15”,“Provider_License_Number_State_Code_15”,“Healthcare_Provider_Primary_Taxonomy_Switch_15”,“Other_Provider_Identifier_1”,“Other_Provider_Identifier_Type_Code_1”,“Other_Provider_Identifier_State_1”,“Other_Provider_Identifier_Issuer_1”,“Other_Provider_Identifier_2”,“Other_Provider_Identifier_Type_Code_2”,“Other_Provider_Identifier_State_2”,“Other_Provider_Identifier_Issuer_2”,“Other_Provider_Identifier_3”,“Other_Provider_Identifier_Type_Code_3”,“Other_Provider_Identifier_State_3”,“Other_Provider_Identifier_Issuer_3”,“Other_Provider_Identifier_4”,“Other_Provider_Identifier_Type_Code_4”,“Other_Provider_Identifier_State_4”,“Other_Provider_Identifier_Issuer_4”,“Other_Provider_Identifier_5”,“Other_Provider_Identifier_Type_Code_5”,“Other_Provider_Identifier_State_5”,“Other_Provider_Identifier_Issuer_5”,“Other_Provider_Identifier_6”,“Other_Provider_Identifier_Type_Code_6”,“Other_Provider_Identifier_State_6”,“Other_Provider_Identifier_Issuer_6”,“Other_Provider_Identifier_7”,“Other_Provider_Identifier_Type_Code_7”,“Other_Provider_Identifier_State_7”,“Other_Provider_Identifier_Issuer_7”,“Other_Provider_Identifier_8”,“Other_Provider_Identifier_Type_Code_8”,“Other_Provider_Identifier_State_8”,“Other_Provider_Identifier_Issuer_8”,“Other_Provider_Identifier_9”,“Other_Provider_Identifier_Type_Code_9”,“Other_Provider_Identifier_State_9”,“Other_Provider_Identifier_Issuer_9”,“Other_Provider_Identifier_10”,“Other_Provider_Identifier_Type_Code_10”,“Other_Provider_Identifier_State_10”,“Other_Provider_Identifier_Issuer_10”,“Other_Provider_Identifier_11”,“Other_Provider_Identifier_Type_Code_11”,“Other_Provider_Identifier_State_11”,“Other_Provider_Identifier_Issuer_11”,“Other_Provider_Identifier_12”,“Other_Provider_Identifier_Type_Code_12”,“Other_Provider_Identifier_State_12”,“Other_Provider_Identifier_Issuer_12”,“Other_Provider_Identifier_13”,“Other_Provider_Identifier_Type_Code_13”,“Other_Provider_Identifier_State_13”,“Other_Provider_Identifier_Issuer_13”,“Other_Provider_Identifier_14”,“Other_Provider_Identifier_Type_Code_14”,“Other_Provider_Identifier_State_14”,“Other_Provider_Identifier_Issuer_14”,“Other_Provider_Identifier_15”,“Other_Provider_Identifier_Type_Code_15”,“Other_Provider_Identifier_State_15”,“Other_Provider_Identifier_Issuer_15”,“Other_Provider_Identifier_16”,“Other_Provider_Identifier_Type_Code_16”,“Other_Provider_Identifier_State_16”,“Other_Provider_Identifier_Issuer_16”,“Other_Provider_Identifier_17”,“Other_Provider_Identifier_Type_Code_17”,“Other_Provider_Identifier_State_17”,“Other_Provider_Identifier_Issuer_17”,“Other_Provider_Identifier_18”,“Other_Provider_Identifier_Type_Code_18”,“Other_Provider_Identifier_State_18”,“Other_Provider_Identifier_Issuer_18”,“Other_Provider_Identifier_19”,“Other_Provider_Identifier_Type_Code_19”,“Other_Provider_Identifier_State_19”,“Other_Provider_Identifier_Issuer_19”,“Other_Provider_Identifier_20”,“Other_Provider_Identifier_Type_Code_20”,“Other_Provider_Identifier_State_20”,“Other_Provider_Identifier_Issuer_20”,“Other_Provider_Identifier_21”,“Other_Provider_Identifier_Type_Code_21”,“Other_Provider_Identifier_State_21”,“Other_Provider_Identifier_Issuer_21”,“Other_Provider_Identifier_22”,“Other_Provider_Identifier_Type_Code_22”,“Other_Provider_Identifier_State_22”,“Other_Provider_Identifier_Issuer_22”,“Other_Provider_Identifier_23”,“Other_Provider_Identifier_Type_Code_23”,“Other_Provider_Identifier_State_23”,“Other_Provider_Identifier_Issuer_23”,“Other_Provider_Identifier_24”,“Other_Provider_Identifier_Type_Code_24”,“Other_Provider_Identifier_State_24”,“Other_Provider_Identifier_Issuer_24”,“Other_Provider_Identifier_25”,“Other_Provider_Identifier_Type_Code_25”,“Other_Provider_Identifier_State_25”,“Other_Provider_Identifier_Issuer_25”,“Other_Provider_Identifier_26”,“Other_Provider_Identifier_Type_Code_26”,“Other_Provider_Identifier_State_26”,“Other_Provider_Identifier_Issuer_26”,“Other_Provider_Identifier_27”,“Other_Provider_Identifier_Type_Code_27”,“Other_Provider_Identifier_State_27”,“Other_Provider_Identifier_Issuer_27”,“Other_Provider_Identifier_28”,“Other_Provider_Identifier_Type_Code_28”,“Other_Provider_Identifier_State_28”,“Other_Provider_Identifier_Issuer_28”,“Other_Provider_Identifier_29”,“Other_Provider_Identifier_Type_Code_29”,“Other_Provider_Identifier_State_29”,“Other_Provider_Identifier_Issuer_29”,“Other_Provider_Identifier_30”,“Other_Provider_Identifier_Type_Code_30”,“Other_Provider_Identifier_State_30”,“Other_Provider_Identifier_Issuer_30”,“Other_Provider_Identifier_31”,“Other_Provider_Identifier_Type_Code_31”,“Other_Provider_Identifier_State_31”,“Other_Provider_Identifier_Issuer_31”,“Other_Provider_Identifier_32”,“Other_Provider_Identifier_Type_Code_32”,“Other_Provider_Identifier_State_32”,“Other_Provider_Identifier_Issuer_32”,“Other_Provider_Identifier_33”,“Other_Provider_Identifier_Type_Code_33”,“Other_Provider_Identifier_State_33”,“Other_Provider_Identifier_Issuer_33”,“Other_Provider_Identifier_34”,“Other_Provider_Identifier_Type_Code_34”,“Other_Provider_Identifier_State_34”,“Other_Provider_Identifier_Issuer_34”,“Other_Provider_Identifier_35”,“Other_Provider_Identifier_Type_Code_35”,“Other_Provider_Identifier_State_35”,“Other_Provider_Identifier_Issuer_35”,“Other_Provider_Identifier_36”,“Other_Provider_Identifier_Type_Code_36”,“Other_Provider_Identifier_State_36”,“Other_Provider_Identifier_Issuer_36”,“Other_Provider_Identifier_37”,“Other_Provider_Identifier_Type_Code_37”,“Other_Provider_Identifier_State_37”,“Other_Provider_Identifier_Issuer_37”,“Other_Provider_Identifier_38”,“Other_Provider_Identifier_Type_Code_38”,“Other_Provider_Identifier_State_38”,“Other_Provider_Identifier_Issuer_38”,“Other_Provider_Identifier_39”,“Other_Provider_Identifier_Type_Code_39”,“Other_Provider_Identifier_State_39”,“Other_Provider_Identifier_Issuer_39”,“Other_Provider_Identifier_40”,“Other_Provider_Identifier_Type_Code_40”,“Other_Provider_Identifier_State_40”,“Other_Provider_Identifier_Issuer_40”,“Other_Provider_Identifier_41”,“Other_Provider_Identifier_Type_Code_41”,“Other_Provider_Identifier_State_41”,“Other_Provider_Identifier_Issuer_41”,“Other_Provider_Identifier_42”,“Other_Provider_Identifier_Type_Code_42”,“Other_Provider_Identifier_State_42”,“Other_Provider_Identifier_Issuer_42”,“Other_Provider_Identifier_43”,“Other_Provider_Identifier_Type_Code_43”,“Other_Provider_Identifier_State_43”,“Other_Provider_Identifier_Issuer_43”,“Other_Provider_Identifier_44”,“Other_Provider_Identifier_Type_Code_44”,“Other_Provider_Identifier_State_44”,“Other_Provider_Identifier_Issuer_44”,“Other_Provider_Identifier_45”,“Other_Provider_Identifier_Type_Code_45”,“Other_Provider_Identifier_State_45”,“Other_Provider_Identifier_Issuer_45”,“Other_Provider_Identifier_46”,“Other_Provider_Identifier_Type_Code_46”,“Other_Provider_Identifier_State_46”,“Other_Provider_Identifier_Issuer_46”,“Other_Provider_Identifier_47”,“Other_Provider_Identifier_Type_Code_47”,“Other_Provider_Identifier_State_47”,“Other_Provider_Identifier_Issuer_47”,“Other_Provider_Identifier_48”,“Other_Provider_Identifier_Type_Code_48”,“Other_Provider_Identifier_State_48”,“Other_Provider_Identifier_Issuer_48”,“Other_Provider_Identifier_49”,“Other_Provider_Identifier_Type_Code_49”,“Other_Provider_Identifier_State_49”,“Other_Provider_Identifier_Issuer_49”,“Other_Provider_Identifier_50”,“Other_Provider_Identifier_Type_Code_50”,“Other_Provider_Identifier_State_50”,“Other_Provider_Identifier_Issuer_50”,“Is_Sole_Proprietor”,“Is_Organization_Subpart”,“Parent_Organization_LBN”,“Parent_Organization_TIN”,“Authorized_Official_Name_Prefix_Text”,“Authorized_Official_Name_Suffix_Text”,“Authorized_Official_Credential_Text”

Hello Jay,
the Wizard can’t extract the metadata, because the line is very long. I’ve reported the problem to our issue tracking system at https://bug.javlin.eu. Please use guest/guest as username/password credentials to see it.

As a workaround you can use attached graph for metadata extraction. Just copy the line with fields names to the tmp.txt file. As a result you will get a fmt file with metadata definition. All fields in the file are of the type string, so you need to change the types manually to the desired values.

Hi,

And thanks again. I found this limit too last night, and just edited down the column names (with various abbreviations) in the header record until I dropped below the character count limit.

Thanks
Jay