view · edit · sidebar · attach · print · history

Index>

20150413-oddb2xml-with-parslet

Summary

  • Switch parsing composition for --calc in oddb2xml to use parslet

Commits

Index

Keep in Mind for work to do
  • Fix dojo error http://www.sitepen.com/blog/2012/10/31/debugging-dojo-common-error-messages/#forgot-dom-ready
  • I removed on May-27 tests for ix_registrationss, fix_sequences, fix_compositions, fix_packages from test/test_plugin/swissmedic.rb,as he could not find any references for them in the src code. Did I erroneously remove stuff when cleaning up the swissmedic import earlier?
  • The whole test for older/newer Packages must be adapted to xlsx. One must compare the rows (e.g. by creating csv files) and do the same stuff in xlsx!
  • creat gem: task: input=file with ean-codes, standard output show ean-codes + atc-code. Source is Swissmedic Packungen.xlsx or XML.
  • Import via data/medreg_companies.yaml
  • Fix problem with radioactivatum 99m-technetio when parsing Wirkstoffe

Switch parsing composition for --calc in oddb2xml to use parslet

Tasks to handle are:

  • Labels with descriptions, e.g I) DTPa-IPV-Komponente (Suspension):
  • Preprations, e.g. Praeparatio cryodesiccata: virus rabiei inactivatum
  • Substances:
    • with or without a dose
    • with or without a correspnding chemical substance (and optional dose)
  • handle coresp.
  • handle residui:
  • handle stuff like pro/et/ex

At the moment I am stuck handling "(" in names and isolated digits ('1') in the name , e.g. virus poliomyelitis typus 1 inactivatum (D-Antigen) 2 mg. Here parslet does not work as I expected it.

Sometimes it gets confusing, why parslet does not match such a simple string like benzoe 40 guttae, but benzoe 40 ml works perfectly. Reordering the possible string in rule dose_unit helped. Unclear to me, why.

With the commits

about 80% of the compositions pass. (Parsed 8937 lines with 2275 errors).

view · edit · sidebar · attach · print · history
Page last modified on April 13, 2015, at 10:17 PM