Skip to content
GitLab
Projects Groups Topics Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Sign in
  • startR startR
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributor statistics
    • Graph
    • Compare revisions
  • Issues 29
    • Issues 29
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 7
    • Merge requests 7
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Earth SciencesEarth Sciences
  • startRstartR
  • Merge requests
  • !172

Initialize big.matrix

  • Review changes

  • Download
  • Patches
  • Plain diff
Merged Nuria Pérez-Zanón requested to merge develop-bigMem into master Feb 09, 2022
  • Overview 1
  • Commits 1
  • Pipelines 1
  • Changes 1

Hi @aho!

This brach partially solves the problem.

I have installed this branch and use it with your minimal example and the following Compute call:

 res_n4 <- Compute(wf,
                   chunks = list(sdate = 1),
                   threads_load = 1,
                   threads_compute = 4,
                   cluster = list(queue_host = 'nord4',
                   queue_type = 'slurm',
                   temp_dir = temp_dir,
                   lib_dir = 
"/.statelite/tmpfs/gpfs/home/bsc32/bsc32339/R/x86_64-pc-linux-gnu-library/3.6",
                   cores_per_job = 2,
                   job_wallclock = '01:00:00',
                   max_jobs = 4,
                   bidirectional = FALSE,
                   polling_period = 10
                 ),
                 ecflow_suite_dir = ecflow_suite_dir,
                 wait = T)

The result is correct in this case:

summary(res_n4$output1)
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
  244.6   251.4   256.9   259.9   272.3   274.0       8 

However, when setting threads_load = 2 it fails:

* Loading... This may take several minutes...
* Progress: 0%Error in parallel::clusterApplyLB(cluster, work_pieces, .LoadDataFile,  :
  object 'transform_crop_domain' not found
In addition: Warning messages:
1: ! Warning: Parameter 'pattern_dims' not specified. Taking the first dimension,
!   'dat' as 'pattern_dims'.
2: ! Warning: Could not find any pattern dim with explicit data set descriptions (in
!   the form of list of lists). Taking the first pattern dim, 'dat', as
!   dimension with pattern specifications.
3: In filebacked.big.matrix(nrow = nrow, ncol = ncol, type = type,  :
  No descriptor file given, it will be named _2842325790_1_1_1_1_1_1_.desc

The data cannot be loaded.
See the original error message:
object 'found_files' not found

Well, I'll keep testing it.

Cheers,

Núria

Assignee
Assign to
Reviewers
Request review from
Time tracking
Source branch: develop-bigMem