Page 2

Forum Replies Created

Viewing 15 posts - 16 through 30 (of 39 total)

← 1 2 3 →

Author

Posts
July 20, 2018 at 11:56 am in reply to: Need to build an interface for XML to SQL Datadata #7471

Fabian Depry
Moderator

As long as those are tears of joy! 🙂

Good luck with your testing.

July 20, 2018 at 10:50 am in reply to: Need to build an interface for XML to SQL Datadata #7469

Fabian Depry
Moderator

Hi Jeff, You can use the SEER Data Generator to create large NAACCR XML files. Only core data items will have a value (about 40 of 50 of them) so it’s not perfect (for example, it would be nice to have text for Abstracts). But it’s better than nothing 🙂

You can download it from this release page: https://github.com/imsweb/data-generator/releases

Look for the “data-generator-X.X-all.jar” where “X.X” is the latest release.

This a standalone JAR, so you should be able to just download it and double-click it (assuming you have a Java environment installed on your machine).

You can find information about the data generator itself and what variables it computes on the project’s home page: https://github.com/imsweb/data-generator

July 16, 2018 at 12:24 pm in reply to: Using SAS with NAACCR XML #7442

Fabian Depry
Moderator

Darn 🙂

I will take another look at some point.

July 16, 2018 at 12:11 pm in reply to: Using SAS with NAACCR XML #7440

Fabian Depry
Moderator

Awesome! Thanks for testing!

July 12, 2018 at 2:30 pm in reply to: Using SAS with NAACCR XML #7420

Fabian Depry
Moderator

Hi Valerie, the issue you reported should be fixed now, and I actually released a new version of the library (version 4.10). If you happen to re-test this, please make sure to use that new version of the SAS JAR file and not the previous 4.9. Thanks!

July 11, 2018 at 8:16 am in reply to: question about patient/turmor model #7335

Fabian Depry
Moderator

Unfortunately, I agree 🙂

Things are going to be rough for the next few years, but I am convinced in the long run, these changes will help the community moving forward in the right direction.

July 11, 2018 at 8:12 am in reply to: question about patient/turmor model #7333

Fabian Depry
Moderator

To tie your second comment to mine, I would say that your analytic record should be based on the persistence data model (the consolidated view) and not the transmission data model. If the new NAACCR XML data model works for your persistence model, that’s great; if it’s not exactly what you need, then you might need to tweak it and use some type of conversion from one to the other.

July 11, 2018 at 8:04 am in reply to: question about patient/turmor model #7332

Fabian Depry
Moderator

Bruce,

I think your question is related to transmission versus persistence. The purpose of transmission is just to move data around (either from hospitals/labs to the Central Registry, or from the Central Registry to the Standard Setters, etc…). The purpose of persistence is to keep a consolidated view of the data at the Central Registry; I would assume that most Registries if not all use some kind of database for that. Both mechanism have a data model but those models do not have to be the same and in many registries, the persistence data model is more complex than the transmission data model because the Registry wants to keep track of more data than can be transmitted via NAACCR data files.

The primary purpose of the NAACCR XML Exchange standard is to transmit data and its data model was design with that in mind. A Registry is welcome to use the same data model for persistence if that works for them, but if not, they probably need to move to a more complex persistence data model.

Taking the address as an example: the standard defines it as the “current address” and so the transmission data model only allows one (the address at the time the abstract was created). If you want to keep track of the addresses over time in your database, you would have to use a more complex data model where you consolidate all the incoming current addresses into a list of addresses.

I understand having different models is not as convenient than using the same model for transmission and persistence because it requires some type of conversion, but in general those conversion should be fairly simple.

July 9, 2018 at 3:15 pm in reply to: Using SAS with NAACCR XML #7312

Fabian Depry
Moderator

Thanks for testing again! It’s really difficult to properly cover all those corner cases! I really should use a standard Java XML parser, but none of them is still compatible with Java 7 which is required by SAS. They really need to move along and update their Java version!!!

I will take a look at this one soon.

July 9, 2018 at 12:32 pm in reply to: NAACCR Fixed-Width data exchange format going away in 2020 #7310

Fabian Depry
Moderator

Just to clarify, the main side effect of the flat file format going away will be that NAACCR will stop defining start column positions for any data items.

Another side effect will be that the State/Requestor Items field will go away (States and Registries will need to use regular variables defined in a user-defined dictionary instead of plugging those variable into one long text field).

There will probably be other side effects, but I think those are the biggest ones.

July 9, 2018 at 10:58 am in reply to: Using SAS with NAACCR XML #7308

Fabian Depry
Moderator

Hi Valerie, I fixed the issue you reported with the CDATA sections. I re-created the JAR file (with the same 4.9 version still) and re-posted it on GitHub. It would be great if you could confirm the fix is working as expected.

Thank you!

June 26, 2018 at 10:33 am in reply to: Need to build an interface for XML to SQL Datadata #7273

Fabian Depry
Moderator

Hi Jeff,

Another resource you might find useful is the Java implementation of the NAACCR XML specifications: https://github.com/imsweb/naaccr-xml

You probably won’t be able to use any of the code (Java doesn’t play well with C# and vice versa) but you can certainly see many examples and get some ideas.

The project home page contains links to the specifications and to a wiki page. I strongly encourage you to take a look at the wiki, it has all kind of information about NAACCR XML, including a (Java-based) standalone tool allowing you to go from XML to flat and flat to XML, and also a bunch of valid and invalid XML files that can be used to test your implementation.

I hope this helps!

June 20, 2018 at 11:45 am in reply to: Using SAS with NAACCR XML #7227

Fabian Depry
Moderator

I will take another look at some point.

To make this work with SAS, which still requires Java 7 or earlier (which has been end-of-life for several years now), I had to implement my own (simple) parsing logic. So there are things that are not going to be properly parsed. Hopefully I can address them as they are found.

June 18, 2018 at 12:58 pm in reply to: Using SAS with NAACCR XML #7220

Fabian Depry
Moderator

Hi Valerie, I implemented the changes we talked about; do you mind re-trying your example when you get a chance?

Note that I removed the version from the macro filenames; I figured it will be easier for people to just replace the files.

I also re-created the SAS JAR file with a fix for the length issue, I re-posted it under the same name (naaccr-xml-4.9-sas.jar) in the release page of the GitHub project (eventually the version will be increased but I figured this can still be considered the “first” version).

Thanks!

June 14, 2018 at 7:39 pm in reply to: Using SAS with NAACCR XML #7209

Fabian Depry
Moderator

That’s great, thanks for testing that solution!

I will add the replace option, that makes sense.

I assume that the truncating is because SAS only uses a subsets of the rows to determine the max length of a given column when reading CSV. If that’s the issue then I think I have a solution. I will try it soon and post new files.
Author

Posts

Viewing 15 posts - 16 through 30 (of 39 total)

← 1 2 3 →

Fabian Depry

Fabian Depry

Forum Replies Created

Recent Topics